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O ' We study first passage percolation on the configuration model. Assuming that each edge has an 

independent exponentially distributed edge weight, we derive explicit distributional asymptotics for 
the minimum weight between two randomly chosen connected vertices in the network, as well as for 
the number of edges on the least weight path, the so-called hopcount. 

We analyze the configuration model with degree power-law exponent r > 2, in which the degrees are 
assumed to be i.i.d. with a tail distribution which is cither of power-law form with exponent r — 1 > 1, 
or has even thinner tails (r = oo). In this model, the degrees have a finite first moment, while the 
variance is finite for r > 3, but infinite for r € (2,3). 
*Jm ' We prove a central limit theorem for the hopcount, with asymptotically equal means and variances 

equal to alogn, where a G (0, 1) for r <G (2, 3), while a > 1 for r > 3. Here n denotes the size of the 
graph. For r € (2,3), it is known that the graph distance between two randomly chosen connected 
vertices is proportional to log log n |25j . i.e., distances are ultra small. Thus, the addition of edge 
weights causes a marked change in the geometry of the network. We further study the weight of the 
least weight path, and prove convergence in distribution of an appropriately centered version. 
• This study continues the program initiated in [5] of showing that log n is the correct scaling for the 

(f) \ hopcount under i.i.d. edge disorder, even if the graph distance between two randomly chosen vertices 

is of much smaller order. The case of infinite mean degrees (r € [1,2)) is studied in [B], where it is 
| proved that the hopcount remains uniformly bounded and converges in distribution. 

rn 

Key words: Flows, random graph, first passage percolation, hopcount, central limit theorem, coupling 
. to continuous-time branching processes, universality. 
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1 Introduction 

The general study of real-world networks has seen a tremendous growth in the last few years. This growth 
occurred both at an empirical level of obtaining data on networks such as the Internet, transportation 
networks, such as rail and road networks, and biochemical networks, such as gene regulatory networks, 
as well as at a theoretical level in the understanding of the properties of various mathematical models for 
these networks. 

We are interested in one specific theoretical aspect of the above vast and expanding field. The setting 
is as follows: Consider a transportation network whose main aim is to transport flow between various 
vertices in the network via the available edges. At the very basic level there are two crucial elements 
which affect the flow carrying capabilities and delays experienced by vertices in the network: 
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(a) The actual graph topology, such as the density of edges and existence of short paths between 
vertices in the graph distance. In this context there has been an enormous amount of interest in the 
concept of small-world networks where the typical graph distance between vertices in the network is of 
order logn or even smaller. Indeed, for many of the mathematical models used to model real- world 
transmission networks, such as the Internet, the graph distance can be of order much smaller than order 
logn. See e.g. [13|, I25j. where for the configuration model with degree exponent r G (2, 3), the remarkable 
result that the graph distance between typical vertices is of order log log n is proved. In this case, we say 
that the graph is ultra small, a phrase invented in [13]. Similar results have appeared for related models 
in \16 \ 133]. The configuration model is described in more detail in Section [2j For introductions to 
scale- free random graphs, we refer to the monographs |12l I17j . for surveys of classical random graphs 
focussing on the Erdos-Renyi random graph, see [8| [30] . 

(b) The second factor which plays a crucial role is the edge weight or cost structure of the graph, which 
can be thought of as representing actual economic costs or congestion costs across edges. Edge weights 
being identically equal to 1 gives us back the graph geometry. What can be said when the edge costs have 
some other behavior? The main aim of this study is to understand what happens when each edge is given 
an independent edge cost with mean 1. For simplicity, we have assumed that the distribution of edge costs 
is exponentially with mean 1 (Exp(l)), leading to first passage percolation on the graph involved. First 
passage percolation with exponential weights has received substantial attention (see [5j l22 | |23" 1 \26 [ l27 l 137] ) . 
in particular on the complete graph, and, more recently, also on Erdos-Renyi random graphs. However, 
particularly the relation to the scale-free nature of the underlying random graph and the behavior of first 
passage percolation on it has not yet been investigated. 

In this paper, we envisage a situation where the edge weights represent actual economic costs, so 
that all flow is routed through minimal weight paths. The actual time delay experienced by vertices in 
the network is given by the number of edges on this least cost path or hopcount H n . Thus, for two 
typical vertices 1 and 2 in the network, it is important to understand both the minimum weight W n of 
transporting flow between two vertices as well as the hopcount H n or the number of edges on this minimal 
weight path. What we shall see is the following universal behavior: 

Even if the graph topology is of ultra-small nature, the addition of random edge weights causes 
a complete change in the geometry and, in particular, the number of edges on the minimal 
weight path between two vertices increases to 0(logn). 

Here we write a n = ®(b n ) if there exist positive constants c and C, such that, for all n, we have cb n < 
On < Cb n . For the precise mathematical results we refer to Sectional We shall see that a remarkably 
universal picture emerges, in the sense that for each r > 2, the hopcount satisfies a central limit theorem 
(CLT) with asymptotically equal mean and variance equal to a logn, where a G (0,1) for r G (2,3), 
while a > 1 for r > 3. The parameter a is the only feature which is left from the randomness of the 
underlying random graph, and a is a simple function of r for r G (2, 3), and of the average forward degree 
for r > 3. This type of universality is reminiscent of that of simple random walk, which, appropriately 
scaled, converges to Brownian motion, and the parameters needed for the Brownian limit are only the 
mean and variance of the step-size. Interestingly, for the Internet hopcount, measurements show that the 
hopcount is close to a normal distribution with equal mean and variance (see e.g., [36])) and it would be 
of interest to investigate whether first passage percolation on a random graph can be used as a model for 
the Internet hopcount. 

This paper is part of the program initiated in [5] to rigorously analyze the asymptotics of distances 
and weights of shortest-weigh paths in random graph models under the addition of edge weights. In 
this paper, we rigorously analyze the case of the configuration model with degree exponent r > 2, the 
conceptually important case in practice, since the degree exponent of a wide variety of real- world networks 
is conjectured to be in this interval. In [6], we investigate the case r G [1,2), where the first moment of 
the degrees is infinite and we observe entirely different behavior of the hopcount H n . 
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2 Notation and definitions 



We are interested in constructing a random graph on n vertices. Given a degree sequence, namely a 
sequence of n positive integers d = {d\, . . . , d n ) with Y17=i assumed to be even, the configuration 
model (CM) on n vertices with degree sequence d is constructed as follows: 

Start with n vertices and di stubs or half-edges adjacent to vertex i. The graph is constructed by 
randomly pairing each stub to some other stub to form edges. Let 

n 

l n = Y J d i (2.1) 

i=i 

denote the total degree. Number the stubs from 1 to l n in some arbitrary order. Then, at each step, two 
stubs which are not already paired are chosen uniformly at random among all the unpaired or free stubs 
and are paired to form a single edge in the graph. These stubs are no longer free and removed from the 
list of free stubs. We continue with this procedure of choosing and pairing two stubs until all the stubs 
are paired. Observe that the order in which we choose the stubs does not matter. Although self-loops 
may occur, these become rare as n — ► oo (see e.g. [8] or [28] for more precise results in this direction). 

Above, we have described the construction of the CM when the degree sequence is given. Here we 
shall specify how we construct the actual degree sequence d, which shall be random. In general, we 
shall let a capital letter (such as Di) denote a random variable, while a lower case letter (such as di) 
denote a deterministic object. We shall assume that the random variables D\ , D2 > • • • D n are independent 
and identically distributed (i.i.d.) with a certain distribution function F. (When the sum of stubs 
L n = Y17=i Di is not even then we shall use the degree sequence -Di,Z?2, • • • ,D n , with D n replaced by 
D n + 1. This does not effect our calculations.) 

We shall assume that the degrees of all vertices are at least 2 and that the degree distribution F is 
regularly varying. More precisely, we assume 

F(D > 2) = 1, and 1 - F{x) = ar (T-1) L(a?), (2.2) 

with t > 2, and where x 1— > L{x) is a slowly varying function for x — > 00. In the case r > 3, we 
shall replace (|2.2j) by the less stringent condition (13. 2|) . Furthermore, each edge is given a random edge 
weight, which in this study will always be assumed to be independent and identically distributed (i.i.d.) 
exponential random variables with mean 1. Because in our setting the vertices are exchangeable, we let 
1 and 2 be the two random vertices picked uniformly at random in the network. 

As stated earlier, the parameter r is assumed to satisfy r > 2, so that the degree distribution has 
finite mean. In some cases, we shall distinguish between r > 3 and r € (2,3), in the former case, the 
variance of the degrees is finite, while in the latter, it is infinite. It follows from the condition Di > 2, 
almost surely, that the probability that the vertices 1 and 2 are connected converges to 1. 

Let / = {fj}j%i denote the probability mass function corresponding to the distribution function F, 
so that fj = F(j) — F(j — 1). Let }^==i denote the size-biased probability mass function corresponding 
to /, defined by 

9j = — , J > 0, (2.3) 

A* 

where \x is the expected size of the degree, i.e., 

00 

V = E[D] = Y,jfj- (2-4) 
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3 Results 



In this section, we state the main results for r > 2. We treat the case where r > 3 in Section [3. II and the 
case where r £ (2, 3) in Section I3T21 The case where r G [1, 2) is deferred to [Bj. 
Throughout the paper, we shall denote by 

(H n ,W n ), (3.1) 

the number of edges and total weight of the shortest-weight path between vertices 1 and 2 in the CM 
with i.i.d. degrees with distribution function F, where we condition the vertices 1 and 2 to be connected 
and we assume that each edge in the CM has an i.i.d. exponential weight with mean 1. 



3.1 Shortest-weight paths for r > 3 

In this section, we shall assume that the distribution function F of the degrees in the CM is non-degenerate 
and satisfies F(x) = 0, x < 2, so that the random variable D is non-degenerate and satisfies D > 2, a.s., 
and that there exist c > and r > 3 such that 

1 - F(x) < cx~ {T ~ l \ x > 0. (3.2) 

Also, we let 

_ E[D(D - 1)] 

V ~ E[D] ' [ ' 

As a consequence of the conditions we have that v > 1. The condition v > 1 is equivalent to the existence 
of a giant component in the CM, the size of which is proportional to n (see e.g. [24 ^ I3 H 132]. for the most 
recent and general result, see |29j). Moreover, the proportionality constant is the survival probability 
of the branching process with offspring distribution {gj}j>i- As a consequence of the conditions on the 
distribution function F, in our case, the survival probability equals 1, so that for n — > oo the graph 
becomes asymptotically connected in the sense that the giant component has n(l — o(l)) vertices. Also, 

when (13. 2|) holds, we have that v < oo. Throughout the paper, we shall let —> denote convergence in 



distribution and — ► convergence in probability. 

Theorem 3.1 (Precise asymptotics for r > 3) Let the degree distribution F of the CM on n vertices 
be non- degenerate, satisfy F{x) = 0, x < 2, and satisfy \3. 2\) for some r > 3. Then, 

(a) the hopcount H n satisfies the CLT 

H n -a\ogn d 

z j *• z > \ 6A ) 

V a log n 

where Z has a standard normal distribution, and 

a = -^e(l,oo); (3.5) 
v — 1 

(b) there exists a random variable V such that 

W n - l ^-^V. (3.6) 
v — 1 

In Section [C] of the appendix, we shall identify the limiting random variable V as 

v = logV^i _ log^2 A log/x(t/- 1) , 3 _ 

V — 1 v — 1 v — 1 v — 1 ' 

where W\,Wi are two independent copies of the limiting random variable of a certain supercritical 
continuous-time branching process and A has a Gumbel distribution. 
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3.2 Analysis of shortest-weight paths for r € (2,3) 



In this section, we shall assume that (|2.2p holds for some r £ (2, 3) and some slowly varying function 
x i — ^ L(x). When this is the case, the variance of the degrees is infinite, while the mean degree is finite. As 
a result, we have that v in (|3.3p equals v = oo, so that the CM is always supercritical (see [25 | [29 l [3"Tl [32] ) . 
In fact, for r £ (2,3), we shall make a stronger assumption on F than (|2.2p . namely, that there exists a 
t £ (2, 3) and < c\ < C2 < oo such that, for all x > 0, 



Theorem 3.2 (Precise asymptotics for r £ (2,3)) Let the degree distribution F of the CM on n ver- 
tices be non- degenerate, satisfy F{x) = 0, x < 2, and satisfy 113. 8\) for some r £ (2,3). Then, 
(a) the hopcount H n satisfies the CLT 

H n -a\ogn & 

/ i * z > (A y J 

V« log n 

where Z has a standard normal distribution and where 

2(t-2) - , , 

« = ^ — r e M); (3.io) 

r — 1 

i/iere exists a limiting random variable V such that 

W n V. (3.11) 

In Section [6l we shall identify the limiting distribution V precisely as 

V = V\ + V 2 , (3.12) 

where V\,V2 are two independent copies of a random variable which is the explosion time of a certain 
infinite- mean continuous-time branching process. 



3.3 Discussion and related literature 

Motivation. The basic motivation of this work was to show that even though the underlying graph 
topology might imply that the distance between two vertices is very small, if there are edge weights 
representing capacities, say, then the hopcount could drastically increase. Of course, the assumption of 
i.i.d. edge weights is not very realistic, however, it allows us to almost completely analyze the minimum 
weight path. The assumption of exponentially distributed edge weights is probably not necessary [H [27] 
but helps in considerably simplifying the analysis. Interestingly, hopcounts which are close to normal with 
asymptotically equal means and variances are observed in Internet (see e.g., [36]). The results presented 
here might shed some light on the origin of this observation. 



Universality for first passage percolation on the CM. Comparing Theorem 13.11 and Theorem 13.21 
we see that a remarkably universal picture emerges. Indeed, the hopcount in both cases satisfies a CLT 
with equal mean and variance proportional to logn, and the proportionality constant a satisfies a £ (0, 1) 
for r £ (2, 3), while a > 1 for r > 3. We shall see that the proofs of Theorems 13.11 and 13.21 run, to a large 
extent, parallel, and we shall only need to distinguish when dealing with the related branching process 
problem to which the neighborhoods can be coupled. 
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The case r € [1, 2) and critical cases r = 2 and r = 3. In [6], we study first passage percolation on 
the CM when r G [1,2), i.e., the degrees have infinite mean. We show that a remarkably different picture 
emerges, in the sense that H n remains uniformly bounded and converges in distribution. This is due to 
the fact that we can think of the CM, when r G [1, 2), as a union of an (essentially) finite number of stars. 
Together with the results in Theorems I3.1H3.21 we see that only the critical cases r = 2 and r = 3 remain 
open. We conjecture that the CLT, with asymptotically equal means and variances, remains valid when 
r = 3, but that the proportionality constant a can take any value in [1, oo), depending on, for example, 
whether v in (|3,3p is finite or not. What happens for r = 2 is less clear to us. 

Graph distances in the CM. Expanding neighborhood techniques for random graphs have been used 
extensively to explore shortest path structures and other properties of locally tree-like graphs. See the 
closely related papers [HI [231 EH1 E3] , where an extensive study of the CM has been carried out. Relevant 
to our context is [251 Corollary 1.4(i)], where it has been shown that when 2 < r < 3, the graph distance 
H n between two typical vertices, which are conditioned to be connected, satisfies the asymptotics 

-, (3.13) 



log log n | log (r — 2) | ' 

as n — > oo, and furthermore that the fluctuations of H n remain uniformly bounded as n — > oo. For r > 3, 
it is shown in [24} Corollary 1.3(i)] that 

-*+ (3.14) 
log n log v 

again with bounded fluctuations. Comparing these results with Theorems I3.1H3.2[ we see the drastic 
effect that the addition of edge weights has on the geometry of the graph. 



The degree structure. In this paper, as in [191 1231 US E3] , we assume that the degrees are i.i.d. with 
a certain degree distribution function F. In the literature, also the setting where the degrees {c£j}™ =1 are 
deterministic and converge in an appropriate sense to an asymptotic degree distribution is studied (see 
e-g-, [HI [201 [291 [311 [32] ) . We expect that our results can be adapted to this situation. Also, we assume 
that the degrees are at least 2 a.s., which ensures that two uniform vertices lie, with high probability 
(whp) in the giant component. We have chosen for this setting to keep the proofs as simple as possible, 
and we conjecture that Theorems 13. lHlOl when instead we condition the vertices 1 and 2 to be connected, 
remain true verbatim in the more general case of the supercritical CM. 



Annealed vs. quenched asymptotics. The problem studied in this paper, first passage percolation 
on a random graph, fits in the more general framework of stochastic processes in random environments, 
such as random walk in random environment. In such problems, there are two interesting settings, namely, 
when we study results when averaging out over the environment and when we freeze the environment 
(the so-called annealed and quenched asymptotics). In this paper, we study the annealed setting, and it 
would be of interest to extend our results to the quenched setting, i.e., study the first-passage percolation 
problem conditionally on the random graph. We expect the results to change in this case, primarily due 
to the fact that we know the exact neighborhood of each point. However, when we consider the shortest- 
weight problem between two uniform vertices, we conjecture Theorems 13. 1H1T21 to remain valid verbatim, 
due to the fact that the neighborhoods of uniform vertices converge to the same limit as in the annealed 
setting (see e.g., [4"1 124"]). 



First passage percolation on the Erdos-Renyi random graph. We recall that the Erdos-Renyi 
random graph G(n,p) is obtained by taking the vertex set [n] = {1, . . . ,n} and letting each edge ij be 
present, independently of all other edges, with probability p. The study closest in spirit to our study is [5], 
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where similar ideas were explored for dense Erdos-Renyi random graphs. The Erdos-Renyi random graph 
G(n,p) can be viewed as a close brother of the CM, with Poisson degrees, hence with r = oo. Consider 
the case where p = fi/n and fj, > 1. In a future paper we plan to show, parallel to the above analysis, that 
H n satisfies a CLT with asymptotically equal mean and variance given by — ^j- log n. This connects up 
nicely with [5], where related results were shown for fi = \i n — > oo, and H n /\ogn was proved to converge 
to 1 in probability. See also [22] where related statements were proved under stronger assumptions on 
/i n . Interestingly, in a recent paper, Ding et al [15] use first passage percolation to study the diameter of 
the largest component of the Erdos-Renyi random graph with edge probability p = (1 + e)/n for e = o(l) 
and e 3 n — > oo. 

The weight distribution. It would be of interest to study the effect of weights even further, for 
example, by studying the case where the weights are i.i.d. random variables with distribution equal to 
E s , where E is an exponential random variable with mean 1 and s € [0, oo). The case s = corresponds 
to the graph distance H n as studied in [191 12H EH] , while the case s = 1 corresponds to the case with i.i.d. 
exponential weights as studied here. Even the problem on the complete graph seems to be open in this 
case, and we intend to return to this problem in a future paper. We conjecture that the CLT remains 
valid for first passage perolation on the CM when the weights are given by independent copies of E s , with 
asymptotic mean and variance proportional to logn, but, when s / 1, we predict that the asymptotic 
means and variances have different constants. 

We became interested in random graphs with edge weights from [9] where, via empirical simulations, a 
wide variety of behavior was predicted for the shortest- weight paths in various random graph models. The 
setup that we analyze is the weak disorder case. In [9], also a number of interesting conjectures regarding 
the strong disorder case were made, which would correspond to analyzing the minimal spanning tree of 
these random graph models, and which is a highly interesting problem. 

Related literature on shortest-weight problems. First passage percolation, especially on the in- 
teger lattice, has been extensively studied in the last fifty years, see e.g. [18] and the more recent survey 
|26] . In these papers, of course, the emphasis is completely different, in the sense that geometry plays an 
intrinsic role and often the goal of the study is to show that there is a limiting "shape" to first passage 
percolation from the origin. 

Janson [27] studies first passage percolation on the complete graph, with exponential weights. His 
main results are 

Wn 3) P 1 m&XjKnWn^ p _ maXj j<n Wn 3) P „ ,ntr\ 

> 1, — >2, f*=— >3. (3.15) 

log n/n log n/n log n/n 

where Wn 3) denotes the weight of the shortest path between the vertices i and j. Recently the authors 
of PQ, showed in the same set-up, that maxj.j< n Hn J) / log n — > a*, where a* « 3.5911 is the unique 
solution of the equation xlogx — x = 1. It would be of interest to investigate such questions in the CM 
with exponential weights. 

The fundamental difference of first passage percolation on the integer lattice, or even on the complete 
graph, is that in our case the underlying graph is random as well, and we are lead to the delicate relation 
between the randomness of the graph together with that of the stochastic process, in this case first 
passage percolation, living on it. Finally, for a slightly different perspective to shortest weight problems, 
see |37] where relations between the random assignment problem and the shortest-weight problem with 
exponential edge weights on the complete graph are explored. 

4 Overview of the proof and organization of the paper 

The key idea of the proof is to first grow the shortest-weight graph (SWG) from vertex 1, until it reaches 
an appropriate size. After this, we grow the SWG from vertex 2 until it connects up with the SWG from 
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vertex 1. The size to which we let the SWG from 1 grow shall be the same as the typical size at which 
the connection between the SWG from vertices 1 and 2 shall be made. However, the connection time at 
which the SWG from vertex 2 connects to the SWG from vertex 1 is random. 

More precisely, we define the SWG from vertex 1, denoted by SWG (1) , recursively. The growth of 
the SWG from vertex 2, which is denoted by SWG (2) , is similar. We start with vertex 1 by defining 
SWGg 1 ' = {1}. Then we add the edge and vertex with minimal edge weight connecting vertex 1 to one 
of its neighbors (or itself when the minimal edge is a self-loop). This defines SWG^ 1 '. We obtain SWG^ 
from SWG^_ 1 by adding the edge and end vertex connected to the SWG m _i with minimal edge weight. 
We informally let SWG^' denote the SWG from vertex i G {1,2} when m edges (and vertices) have been 
added to it. This definition is informal, as we shall need to deal with self-loops and cycles in a proper way. 
How we do this is explained in more detail in Section 14.21 As mentioned before, we first grow SWG^ 
to a size a n , which is to be chosen appropriately. After this, we grow SWG^, and we stop as soon as a 
vertex of SWGi x) appears in {SWG^}^_q, as then the shortest- weight path between vertices 1 and 2 has 
been found. Indeed, if on the contrary, the shortest weight path between vertex 1 and vertex 2 contains 
an edge not contained in the union of the two SWGs when they meet, then necessarily this edge would 
have been chosen in one of the two SWGs at an earlier stage, since at some earlier stage this edge must 
have been incident to one of the SWGs and had the minimal weight of all edges incident to that SWG. 
In Sections 14.21 and 14.31 we shall make these definitions precise. 

Denote this first common vertex by A, and let Gi be the distance between vertex i and A, i.e., the 
number of edges on the minimum weight path from i to A. Then, we have that 

H n = G 1 + G 2 , (4.1) 

while, denoting by Tj the weight of the shortest-weight paths from i to A, we have 

W n = Ti + T 2 . (4.2) 

Thus, to understand the random variables H n and W n , it is paramount to understand the random variables 
Tj and Gi, for i = 1,2. 

Since, for n — > oo, the topologies of the neighborhoods of vertices 1 and 2 are close to being indepen- 
dent, it seems likely that G± and G2, as well as T\ and T 2 are close to independent. Since, further, the 
CM is locally tree-like, we are lead to the study of the problem on a tree. 

With the above in mind, the paper is organized as follows: 

• In Section 14.11 we study the flow on a tree. More precisely, in Proposition 14.31 we describe the 
asymptotic distribution of the length and weight of the shortest-weight path between the root and 
the m th added vertex in a branching process with i.i.d. degrees with offspring distribution g in (|2,3D . 
Clearly, the CM has cycles and self-loops, and, thus, sometimes deviates from the tree description. 

• In Section 14.21 we reformulate the problem of the growth of the SWG from a fixed vertex as a 
problem of the SWG on a tree, where we find a way to deal with cycles by a coupling argument, 
so that the arguments in Section 14.11 apply quite literally. In Proposition 14.61 we describe the 
asymptotic distribution of the length and weight of the shortest- weight path between a fixed vertex 
and the m th added vertex in the SWG from the CM. However, observe that the random variables 
Gi described above are the generation of a vertex at the time at which the two SWGs collide, and 
this time is a random variable. 

• In Section f4.3|, we extend the discussion to this setting, and formulate the necessary ingredients for 
the collision time, i.e., the time at which the connecting edge appears, in Proposition 14.41 In Section 
14.51 we complete the outline. 

• The proofs of the key propositions are deferred to Sections [5H3 

• Technical results needed in the proofs in Sections [SHU for example on the topology of the CM, are 
deferred to the appendix. 
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4.1 Description of the flow clusters in trees 

We shall now describe the construction of the SWG in the context of trees. In particular, below, we shall 
deal with a flow on a branching process tree, where the offspring is deterministic. 

Deterministic construction: Suppose we have positive (non-random) integers di,d,2, ■ ■ ■■ Consider the 
following construction of a branching process in discrete time: 

Construction 4.1 (Flow from root of tree) The shortest-weight graph on a tree with degrees {di}^ =1 
is obtained as follows: 

1. At time 0, start with one alive vertex (the initial ancestor); 

2. At each time step i, pick one of the alive vertices at random, this vertex dies giving birth to di 
children. 

In the above construction, the number of offspring di is fixed once and for all. For a branching process 
tree, the variables di are i.i.d. random variables. This case shall be investigated later on, but the case of 
deterministic degrees is more general, and shall be important for us to be able to deal with the CM. 

Consider a continuous-time branching process defined as follows: 

1. Start with the root which dies immediately giving rise to d\ alive offspring; 

2. Each alive offspring lives for Exp(l) amount of time, independent of all other randomness involved; 

3. When the m th vertex dies it leaves behind d m alive offspring. 

The split-times (or death-times) of this branching process are denoted by Tj, i > 1. Note that 
the Construction 14.11 is equivalent to this continuous branching process, observed at the discrete times 
Tj , i > 1 . The fact that the chosen alive vertex is chosen at random follows from the memoryless property 
of the exponential random variables that compete to become the minimal one.) We quote a fundamental 
result from [10]. In its statement, we let 

«i = di + •■• + *-(*- 1)0 (4.3) 

Proposition 4.2 (Shortest-weight paths on a tree) Pick an alive vertex at time m > 1 uniformly 
at random among all vertices alive at this time. Then, 

(a) the generation of the m th chosen vertex is equal in distribution to 

m 

G m = J2^ ( 4 - 4 ) 

i=l 

where {Ii}f2 zl are independent Bernoulli random variables with 

P(Ji = 1) = di/si, (4.5) 

(b) the weight of the shortest-weight path between the root of the tree and the vertex chosen in the m th 
step is equal in distribution to 

m 

T m = Y,E l /s u (4.6) 

i=l 

where {Ei} ( ^ =1 are i.i.d. exponential random variables with mean 1. 

A new probabilistic proof is added, since there is some confusion between the definition Sj given here, and the definition 
of Si given in [10] below Equation (3.1)]. More precisely, in [10], Sj is defined as Sj = d\ + . . . + di — i, which is our Sj — 1. 
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Proof. We shall prove part (a) by induction. The statement is trivial for m = 1. We next assume that 
(|4.4p holds for m, where are independent Bernoulli random variables satisfying (|4.5p . Let G m +i 

denote the generation of the randomly chosen vertex at time m + 1, and consider the event {G m+ i = 
fc}, 1 < < m. If randomly choosing one of the alive vertices at time m + 1 results in one of the d m +i 
newly added vertices, then, in order to obtain generation k, the previous uniform choice, i.e., the choice 
of the vertex which was the last one to die, must have been a vertex from generation k — 1. On the other 
hand, if a uniform pick is conditioned on not taking one of the d m+ \ newly added vertices, then this 
choice must have been a uniform vertex from generation k. Hence, we obtain, for 1 < k < m, 

F(G m+1 = k) = ^P(G m = k - 1) + (l - ^±i)p(G m = k). (4.7) 

The proof of part (a) is now immediate from the induction hypothesis. The proof of part(b) is as follows. 
The minimum of Sj independent exp(l) random variables has an exponential distribution with parameter 
Si, and is hence equal in distribution to Ei/si. We further use the memoryless property of the exponential 
distribution which guarantees that at each of the discrete time steps the remaining lifetimes (or weights) 
of the alive vertices are independent exponential variables with mean 1, independent of what happened 
previously. ■ 

We note that, while Proposition 14.21 was applied in [10|, Theorem 3.1] only in the case where the 
degrees are i.i.d., in fact, the results hold more generally for every tree (see e.g., [TUJ Equation (3.1)], and 
the above proof). This extension shall prove to be vital in our analysis. 

We next intuitively relate the above result to our setting. Start from vertex 1, and iteratively choose 
the edge with minimal additional weight attached to the SWG so far. As mentioned before, because of 
the properties of the exponential distribution, the edge with minimal additional weight can be considered 
to be picked uniformly at random from all edges attached to the SWG at that moment. With high 
probability, this edge is connected to a vertex which is not in the SWG. Let Bi denote the forward degree 
(i.e., the degree minus 1) of the vertex to which the 7 th edge is connected. By the results in |24} I25j . 
{Bi}i>2 are close to being i.i.d., and have distribution given by (|2.3|) . Therefore, we are lead to studying 
random variables of the form (j4.4H — (j4.5p . where {-Bi}^ are i.i.d. random variables. Thus, this means 
that we study the unconditional law of G m in (|4.4p . in the setting where the vector {di}fl 1 is replaced 
by an i.i.d. sequence of random variables We shall first state a CLT for G m and a limit result 

for T m in this setting. In its statement, we shall also make use of the random variable T m , which is the 
weight of the shortest weight path between the root and the parent of the 777 th individual in the branching 
process. Thus, in particular, T m < T m , and T m — T m is the time between the addition of the parent of 
the m th individual and the m th individual itself. 

Proposition 4.3 (Asymptotics for shortest- weight paths on trees) Let {Bi}^ 1 be an i.i.d. se- 
quence of non- degenerate, positive integer valued, random variables, satisfying 

F(Bi > k) = k 2 ~ T L(k), r > 2, 

for some slowly varying function k t— > L(k). Denote by v = E[-Bi], for r > 3, whereas v = oo, for 
t E (2,3). Then, 

(a) for G m given in fr4-4\ )~ [4-5\ )> w ith di replaced by Bi, there exists a (3>1 such that, as m—> oo, 

G m -/?log777 d 7 , „ Kr( . ,. Q s 
> Z, where z~A(0, 1), (4.8) 

V p log 777 

a standard normal variable, and where (3 = v j{y — 1) for t > 3, while (3 = 1 for r E (2, 3); 

(b) for T m given in f^.g} ), there exists random variables X,X such that 

T m - 7 log 777 — > X, T m - 7 log 777 — > X, (4.9) 

where 7 = l/[y — 1) when t > 3, while 7 = when r E (2, 3). In the latter case X = X. 
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Proposition 14.31 is proved in [TO] Theorem 3.1] when Var(-Bj) < oo, which holds when r > 4, but not 
when r S (2, 4). We shall prove Proposition 14.31 in Section [5] below. There, we shall also see that the result 
persists under weaker assumptions than {Bi}^ zl being i.i.d., for example, when {Bi}^ 1 are exchangeable 
non-negative integer valued random variables satisfying certain conditions. Such extensions shall prove 
to be useful when dealing with the actual (forward) degrees in the CM. 

4.2 A comparison of the flow on the CM and the flow on the tree 

Proposition 14.31 gives a CLT for the generation when considering a flow on a tree. In this section, we shall 
relate the problem of the flow on the CM to the flow on a tree. The key feature of this construction is that 
we shall simultaneously grow the graph topology neighborhood of a vertex, as well as the shortest-weight 
graph from it. This will be achieved by combining the construction of the CM as described in Section 
[2] with the fact that, from a given set of vertices and edges, if we grow the shortest-weight graph, each 
potential edge is equally likely to be the minimal one. 

In the problem of finding the shortest weight path between two vertices 1 and 2, we shall grow two 
SWGs simultaneously from the two vertices 1 and 2, until they meet. This is the problem that we actually 
need to resolve in order to prove our main results in Theorems I3.1H3.21 The extension to the growth of 
two SWGs is treated in Section 14.31 below. 

The main difference between the flow on a graph and on a tree is that on the tree there are no cycles, 
while on a graph there are. Thus, we shall adapt the growth of the SWG for the CM in such a way that 
we obtain a tree (so that the results from Section 14.11 apply) , while we can still retrieve all information 
about shortest-weight paths from the constructed graph. This will be achieved by introducing the notion 
of artificial vertices and stubs. We start by introducing some notation. 

We denote by {SWG m } m >o the SWG process from vertex 1. We construct this process recursively. 
We let SWGo consist only of the alive vertex 1, and we let So = 1. We next let SWGi consist of the D\ 
allowed stubs and of the explored vertex 1 , and we let S\ = So + D\ — 1 = D\ denote the number of allowed 
stubs. In the sequel of the construction, the allowed stubs correspond to vertices in the shortest-weight 
problem on the tree in Section B~T1 This constructs SWGi. Next, we describe how to construct SWG m 
from SWG„,_i. For this construction, we shall have to deal with several types of stubs: 

(a) the allowed stubs at time m, denoted by AS m , are the stubs that are incident to vertices of the SWG m , 
and that have not yet been paired to form an edge; S m = |AS m | denotes their number; 

(b) the free stubs at time m, denoted by FS m , are those stubs of the L n total stubs which have not yet 
been paired in the construction of the CM up to and including time m; 

(c) the artificial stubs at time m, denoted by Art m , are the artificial stubs created by breaking ties, as 
described in more detail below. 

We note that Art m C AS m , indeed, AS m \FS m = Art m . Then, we can construct SWG m from SWG m _i 
as follows. We choose one of the S rn -\ allowed stubs uniformly at random, and then, if the stub is not 
artificial, pair it uniformly at random to a free stub unequal to itself. Below, we shall consistently call 
these two stubs the chosen stub and the paired stub, respectively. There are 3 possibilities, depending on 
what kind of stub we choose and what kind of stub it is paired to: 

Construction 4.4 (The evolution of SWG for CM as SWG on a tree) 

(1) The chosen stub is real, i.e., not artificial, and the paired stub is not one of the allowed stubs. In 
this case, which shall be most likely at the start of the growth procedure of the SWG, the paired stub is 
incident to a vertex outside SWG m _i, we denote by B m the forward degree of the vertex incident to the 
paired stub (i.e, its degree minus 1), and we define S m = S m -i + B m — 1. Then, we remove the paired 
and the chosen stub from AS m _i and add the B m stubs incident to the vertex incident to the paired stub 
to AS m _i to obtain AS m , we remove the chosen and the paired stubs from FS m _i to obtain FS m , and 
Art m = Art m _i 7 ' 

(2) The chosen stub is real and the paired stub is an allowed stub. In this case, the paired stub is incident 
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to a vertex in SWG m _i and we have created a cycle. In this case, we create an artificial stub replacing 
the paired stub, and denote B m = 0. Then, we let S m = S m -\ — 1, remove both the chosen and paired 
stubs from AS m _i and add the artificial stub to obtain AS m , and remove the chosen and paired stub from 
FS m _i to obtain FS m , while Art m is Art m _i together with the newly created artificial stub. In SWG m , 
we also add an artificial edge to an artificial vertex in the place where the chosen stub was, the forward 
degree of the artificial vertex being 0. This is done because a vertex is added each time in the construction 
on a tree. 

(3) The chosen stub is artificial. In this case, we let B m = ; S m = S m -i — 1, and remove the chosen 
stub from AS m _i and Art m _i to obtain AS m and Art m , while FS m = FS m _i. 

In the construction in Construction 14.44 we always work on a tree since we replace an edge which 
creates a cycle, by one artificial stub, to replace the paired stub, and an artificial edge plus an artificial 
vertex in the SWG m with degree 0, to replace the chosen stub. Note that the number of allowed edges 
at time m satisfies S m = 5 m _i + B m — 1, where B\ = D\ and, for m > 2, in cases (2) and (3), B m = 0, 
while in case (1) (which we expect to occur in most cases), the distribution of B m is equal to the forward 
degree of a vertex incident to a uniformly chosen stub. Here, the choice of stubs is without replacement. 

The reason for replacing cycles as described above is that we wish to represent the SWG problem as 
a problem on a tree, as we now will explain informally. On a tree with degrees {cZj}?^, as in Section 
14.11 we have that the remaining degree of vertex i at time m is precisely equal to a\ minus the number 
of neighbors that are among the m vertices with minimal shortest-weight paths from the root. For first 
passage percolation on a graph with cycles, a cycle does not only remove one of the edges of the vertex 
incident to it (as on the tree), but also one edge of the vertex at the other end of the cycle. Thus, this is 
a different problem, and the results from Section [4.11 do not apply literally. By adding the artificial stub, 
edge and vertex, we artificially keep the degree of the receiving vertex the same, so that we do have the 
same situation as on a tree, and we can use the results in Section |4"TT1 However, we do need to investigate 
the relation between the problem with the artificial stubs and the original SWG problem on the CM. 
That is the content of the next proposition. 

In its statement, we shall define the m th closest vertex to vertex 1 in the CM, with i.i.d. exponential 
weights, as the unique vertex of which the minimal weight path is the m th smallest among all n — 1 
vertices. Further, at each time m, we denote by artificial vertices those vertices which are artificially 
created, and we call the other vertices real vertices. Then, we let the random time R m be the first time 
j that SWGj consists of m + 1 real vertices, i.e., 

R m = min {j >0 : SWGy contains m + 1 real vertices}. (4-10) 

The +1 originates from the fact that at time m = 0, SWGo consists if 1 real vertex, namely, the vertex 
from which we construct the SWG. Thus, in the above set up, we have that R m = m precisely when no 
cycle has been created in the construction up to time m. Then, our main coupling result is as follows: 

Proposition 4.5 (Coupling shortest-weight graphs on a tree and CM) Jointly for all m > I, 
the set of real vertices in SWG# m is equal in distribution to the set of i th closest vertices to vertex 
1, for i = 1, . . . , m. Consequently, 

(a) the generation of the m th closest vertex to vertex 1 has distribution G^ m , where G m is defined in 
j^.^| )- |775| ) with d\ = D\ and d{ = Bi, i>2, as described in Construction \4-4\ 

(b) the weight of the shortest weight path to the m th closest vertex to vertex 1 has distribution Tfj m , where 
T m is defined in ft4-6\ ) with d\ = D\ and di = Bi, i > 2, as described in Construction \4-4\ 

We shall make use of the nice property that the sequence {Bji m }^ l= 2, which consists of the forward 
degrees of chosen stubs that are paired to stubs which are not in the SWG, is, for the CM, an exchangeable 
sequence of random variables (see Lemma 16.11 below). This is due to the fact that a free stub is chosen 
uniformly at random, and the order of the choices does not matter. This exchangeability shall prove to 
be useful in order to investigate shortest-weight paths in the CM. We now prove Proposition 14.51 
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Proof of Proposition 14.51 In growing the SWG, we give exponential weights to the set {AS m }m>i- 
After pairing, we identify the exponential weight of the chosen stub to the exponential weight of the edge 
which it is part of. We note that by the memoryless property of the exponential random variable, each 
stub is chosen uniformly at random from all the allowed stubs incident to the SWG at the given time. 
Further, by the construction of the CM in Section [21 this stub is paired uniformly at random to one 
of the available free stubs. Thus, the growth rules of the SWG in Construction 14.41 equal those in the 
above description of {SWG m }^ =0 , unless when a cycle is closed and an artificial stub, edge and vertex are 
created. In this case, the artificial stub, edge and vertex might influence the law of the SWG. However, we 
note that the artificial vertices are not being counted in the set of real vertices, and since artificial vertices 
have forward degree 0, they will not be a part of any shortest path to a real vertex. Thus, the artificial 
vertex at the end of the artificial edge does not affect the law of the SWG. Artificial stubs that are created 
to replace paired stubs when a cycle is formed, and which are not yet removed at time m, will be called 
dangling ends. Now, if we only consider real vertices, then the distribution of weights and lengths of the 
shortest-weight paths between the starting points and those real vertices are identical. Indeed, we can 
decorate any graph with as many dangling ends as we like without changing the shortest-weight paths to 
real vertices in the graph. ■ 

Now that the flow problem on the CM has been translated into a flow problem on a related tree of 
which we have explicitly described its distribution, we may make use of Proposition 14.2^ which shall allow 
us to extend Proposition 14.31 to the setting of the CM. Note that, among others due to the fact that when 
we draw an artificial stub, the degrees are not i.i.d. (and not even exchangeable since the probability of 
drawing an artificial stub is likely to increase in time), we need to extend Proposition 14.31 to a setting 
where the degrees are weakly dependent. In the statement of the result, we recall that G m is the height 
of the m th added vertex in the tree problem above. In the statement below, we write 

fare (2,3); 
In 1 / 2 for r > 3, 

where, for a, b G M, we write a A b = min{a, b}. 

Before we formulate the CLT for the hopcount of the shortest-weight graph in the CM, we repeat 
once more the setup of the random variables involved. Let So = 1, Si = Di, and for j > 2, 



S j =D 1 + J2(B i -l), (4.12) 



i=2 



where, in case the chosen stub is real, i.e., not artificial, and the paired stub is not one of the allowed 
stubs, Bi equals the forward degree of the vertex incident to the i th paired stub, whereas Bi = otherwise. 
Finally, we recall that, conditionally on D\,B2, -B3, . . . , B m , 

m 

G m = ^2li, where P(Ji = 1) = 1, F(Ij = 1) = Bj/Sj, 2 < j < m. (4.13) 

i=i 

Proposition 4.6 (Asymptotics for shortest- weight paths in the CM) (a) Let the law of G m be 

given in \4.13\) . Then, with (3 > 1 as in Proposition \4-3[ and as long as m < m n , for any m n such that 
log (m n /a n ) = o(Vlogn), 

G <»-J losm jL+Z, where Z~Af(0,l). (4.14) 
V p log m 

(b ) Let the law of T m be given in (|4.6p , with Sj replaced by Si given by (|4.12p and let 7 be as in Proposition 
\4-3[ Then, there exists a random variable X such that 

T m - 7 logm^X (4.15) 
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The same results apply to Gn m andTn m , i.e., in the statements (a) and (b) the integer m can be replaced 
by R m , as long as m < m n . 

Proposition 14.61 implies that the result of Proposition 14.31 remains true for the CM, whenever m is not 
too large. Important for the proof of Proposition 14.61 is the coupling to a tree problem in Proposition 14.51 
Proposition 14.61 shall be proved in Section [6j An important ingredient in the proof will be the comparison 
of the variables {Bm}m=2i f° r an appropriately chosen m n , to an i.i.d. sequence. Results in this direction 
have been proved in |24^ I25j. and we shall combine these to the following statement: 

Proposition 4.7 (Coupling the forward degrees to an independent sequence) In the CM with 
t > 2, there exists a p > such that the random vector {B m }m=2 can be coupled to an independent 
sequence of random variables {Bm d) }m=2 w ith probability mass function g in \2.3\) in such a way that 
{5 m }t 2 = {Btt d) }% =2 whp. 

In Proposition 14.71 in fact, we can take {B m }m=2 t° be the forward degree of the vertex to which any 
collection of n p distinct stubs has been connected. 

4.3 Flow clusters started from two vertices 

To compute the hopcount, we first grow the SWG from vertex 1 until time a n , followed by the growth 
of the SWG from vertex 2 until the two SWGs meet, as we now explain in more detail. Denote by 
{SWG$}~ =0 the SWG from the vertex i G {1, 2}, and, for m > 0, let 

SWG^ 2 ' = SWG« U SWG&\ (4.16) 

the union of the SWGs of vertex 1 and 2. We shall only consider values of m where SWG^ and SWG^ 
are disjoint, i.e., they do not contain any common (real) vertices. We shall discuss the moment when 
they connect in Section f4.4l below. 

We recall the notation in Section H~2l and, for i G {1,2}, denote by AS^' and Art^' the number of 
allowed and artificial stubs in SWG^. We let the set of free stubs FS m consist of those stubs which 
have not yet been paired in SWG^' 2 ' in (|4.16p . Apart from that, the evolution of SWG^, following the 
evolution of SWG^, is identical as in Construction [44l We denote by Sm = |AS^| the number of allowed 
stubs in SWG^ for i G {1, 2}. We define B$ accordingly. 

The above description shows how we can grow the SWG from vertex 1 followed by the one of vertex 
2. In order to state an adaptation of Proposition 14.51 to the setting where the SWGs of vertex 1 is first 
grown to size a n , followed by the growth of the SWG from vertex 2 until the connecting edge appears, 
we let the random time Rm be the first time I such that SWG[ l) consists of m + 1 real vertices. Then, 
our main coupling result for two simultaneous SWGs is as follows: 

Proposition 4.8 (Coupling SWGs on two trees and CM from two vertices) Jointly for m > 0, 
as long as the sets of real vertices in (SWG^, SWG^) are disjoint, these sets are equal in distribution 
to the sets ofjf 1 , respectively j^ 1 , closest vertices to vertex 1 and 2, respectively, for ji = 1, . . . ,Ral and 
j2 = 1, . . . , Rm , respectively. 

4.4 The connecting edge 

As described above, we grow the two SWGs until the first stub with minimal weight incident to SWG^ 
is paired to a stub incident to SWG^. We call the created edge linking the two SWGs the connecting 
edge. More precisely, let 

C n = min{m > : SWG™ n SWG« / 0}, (4.17) 

be the first time that SWG' 15 and SWG'!' share a vertex. When m = 0, this means that 2 G SWG' 1 ' 
(which we shall show happens with small probability), while when m > 1, this means that the m th -stub 
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of SWG (2) which is chosen and then paired, is paired to a stub from SWG^. The path found actually 
is the shortest-weight path between vertices 1 and 2, since SWG^ and SWG^ precisely consists of the 
closest real vertices to the root i, for i = 1,2, respectively. 

We now study the probabilistic properties of the connecting edge. Let the edge e = st be incident to 
SWGi 1 ', and s and t denote its two stubs. Let the vertex incident to s be i, and the vertex incident to t 
be it- Assume that i s £ SWG^, so that, by construction, i t SWG^. Then, conditionally on SWG^ 
and {T-^}®^, the weight of e is at least Taj — where W> is the weight of the shortest path from 1 
to i s . By the memoryless property of the exponential distribution, therefore, the weight on edge e equals 
Ta„ ~ W£ +E e , where the collection (E e ), for all e incident to SWG^ are i.i.d. Exp(l) random variables. 
Alternatively, we can redistribute the weight by saying that the stub t has weight E e , and the stub s has 
weight Taj — Wj (1> - Further, in the growth of (SWG^') m >o, we can also think of the exponential weights 
of the edges incident to SWG^' being positioned on the stubs incident to SWG^. Hence, there is no 
distinction between the stubs that are part of edges connecting SWG^ and SWG^ and the stubs that 
are part of edges incident to SWG^, but not to SWG^. Therefore, in the growth of (SWG^) m >o, we 
can think of the minimal weight stub incident to SWG^ being chosen uniformly at random, and then a 
uniform free stub is chosen to pair it with. As a result, the distribution of the stubs chosen at the time 
of connection is equal to any of the other (real) stubs chosen along the way. This is a crucial ingredient 
to prove the scaling of the shortest-weight path between vertices 1 and 2. 

For i G {1,2}, let denote the length of the shortest- weight path between vertex i and the common 
vertex in SWG^ and SWG^ , so that 

H n = H^+H%\ (4.18) 
Because of the fact that at time C n we have found the shortest-weight path, we have that 

M^) = (G£ +1 -l,Gg>), (4.19) 

where {G m ] }^ =1 and {G m } }^ = i copies of the process in (|4.4p. which are conditioned on drawing a real 
stub. Indeed, at the time of the connecting edge, a uniform (real) stub of SWG^ is drawn, and it is 
paired to a uniform (real) stub of SWG^j . The number of hops in SWG^ to the end of the attached edge 
is therefore equal in distribution to conditioned on drawing a real stub. The —1 in (|4.19p arises 

since the connecting edge is counted twice in G a 1 , 1 + G^i- The processes {Gm}™ =1 and {Gm'}™ =1 are 
conditionally independent given the realizations of {Bm}JJ, =2 . 

Further, because of the way the weight of the potential connecting edges has been distributed over 
the two stubs out of which the connecting edge is comprised, we have that 

W n = T£+I™, (4.20) 

where {Tm}^=i and {Tm}m=i are two copies of the process {T m }^ =1 in (|4.6p . again conditioned on 
drawing a real stub. Indeed, to see (|4.20p . we note that the weight of the connecting edge is equal to the 
sum of weights of its two stubs. Therefore, the weight of the shortest weight path is equal to the sum of 
the weight within SWG^ , which is equal to Taj , and the weight within SWG^f , which is equal to T^ 2) . 

In the distributions in (|4.19|) and (|4.20p above, we always condition on drawing a real stub. Since we 
shall show that this occurs whp, this conditioning plays a minor role. 

We shall now intuitively explain why the leading order asymptotics of C n is given by a n , where a n 
is defined in (14. lip . For this, we must know how many allowed stubs there are, i.e., we must determine 
how many stubs there are incident to the union of the two SWGs at any time. Recall that Sm denotes 
the number of allowed stubs in the SWG from vertex i at time m. The total number of allowed stubs 
incident to SWG^ is SaJ , while the number incident to SWG^' is equal to Sm , and where 

m 

S%=D i + J2(Bt i) -l). (4.21) 

1=2 
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We also write Art m = Art^j U Art$ . 

Conditionally on SWG£ and {{Sf, Artf ) )}fj x 1 and L n , and assuming that |Art m |, m and S m satisfy 
appropriate bounds, we obtain 

P(C n = m|C n > m- 1) Rs (4.22) 
When r G (2,3) and fl£BJ) holds, then Sf 7/ 1/(t_2) 

can be expected to converge in distribution to a 
stable random variable with parameter r — 2, while, for r > 3, f /Z converges in probability to v — 1, 
where is defined in (13. 3D . We can combine these two statements by saying that gW// 1 /^ 3 - 2 ) converges 
in distribution. Note that the typical size a n of C n is such that, uniformly in n, P(C n G [a n , 2a n ]) remains 
in (e, 1 — e), for some e G (0, ^), which is the case when 

2a„ 

P(C n G [a n , 2a n ]) = P(C n = m|C n > m - l)P(C n > m - 1) G (e, 1 - e), (4.23) 

m=a n 

uniformly as n — > oo. By the above discussion, and for a n < m < 2a n , we have P(C n = m\C n > m — 1) = 

( m l/(rA3-2)/ n ) = e ^ l/(TA3-2)y n ^ > m _ 1) = @ ^ rj,^ wg arrive &t 

P(C„ G [an, 2a n ]) = e(a n ay( TA3 ~ 2 Vn), (4.24) 
which remains uniformly positive and bounded for a n defined in (|4.1ip . In turn, this suggests that 

C n /a n -U M, (4.25) 

for some limiting random variable M. 

We now discuss what happens when (|2.2j) holds for some r G (2,3), but (13. 8p fails. In this case, 
there exists a slowly varying function n \— ► £(n) such that /(£(l)l 1 ^ T ~ 2 ^) converges in distribution. 
Then, following the above argument shows that the right-hand side (r.h.s.) of (I4.24p is replaced by 
Q{a n a n ^ T 2 ^£(a n )/n), which remains uniformly positive and bounded for a n satisfying an ^ £{a n ) = 

n>- By Theorem 1.5.12], there exists a solution a n to the above equation which satisfies that it is 
regularly varying with exponent (r — 2)/(r — 1), so that 

a n = n^/^t{n), (4.26) 

for some slowly varying function n i— ► which depends only on the distribution function F. 

In the following proposition, we shall state the necessary result on C n that we shall need in the 
remainder of the proof. In its statement, we shall use the symbol o F (b n ) to denote a random variable X n 
which satisfies that X n /b n 0. 

Proposition 4.9 (The time to connection) As n — > oo, under the conditions of Theorems \3.2\ and 
\3.1\ respectively, and with a n as in ^.llty , 



log C n - loga n = o P (Vlogn). (4.27) 
Furthermore, for i G {1,2}, and with (3 > 1 as in Proposition ^. 3\ 

, Gi 1} , i - log a n G< 2) - (3 log a n . d 

( = — , C " - (Zi, Z 2 ), (4.28) 
V/Jlogan V/3!ogan 

where Z\,Z 2 are two independent standard normal random variables. Moreover, with 7 as in Proposition 



4-3\ there exist random variables X\ , X2 such that 

(T&> -7 log a n ,T$l -7 log a n ) (X 1 ,X 2 ). (4.29) 
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We note that the main result in (|4.28j) is not a simple consequence of (|4.27p and Proposition 14,61 
The reason is that C n is a random variable, which a priori depends on (G^ +1 ,Gm) for m > 0. Indeed, 
the connecting edge is formed out of two stubs which are not artificial, and thus the choice of stubs is 
not completely uniform. However, since there are only few artificial stubs, we can extend the proof of 
Proposition 14.61 to this case. Proposition 14.91 shall be proved in Section [71 



4.5 The completion of the proof 

By the analysis in Section 14.41 we know the distribution of the sizes of the SWGs at the time when the 
connecting edge appears. By Proposition 14.91 we know the number of edges and their weights used in the 
paths leading to the two vertices of the connecting edge, together with its fluctuations. In the final step, 
we need to combine these results by averaging both over the randomness of the time when the connecting 
edge appears (which is a random variable), as well as over the number of edges in the shortest weight 
path when we know the time the connecting edge appears. Note that by (|4.19p and Proposition I4.9[ we 
have, with Z\, Z 2 denoting independent standard normal random variables, and with Z = (Z\ + Z 2 )/V2, 
which is again standard normal, 

H n = Gg+i + G$ n -1 = 2(3 log a n + Z x V ' [3\oga n + Z 2 y/p\oga n + o T {^n) 

= 2 ( 31oga n + Z v / 2/31oga n + o P ( v / ioI^). (4.30) 

Finally, by fljTQ) , this gives ([331) and with 

2(3 log a n 

a = lim — , (4.31) 

logn 



n— »oo 



which equals a = vj(y — 1), when r > 3, since (3 = vj{y — 1) and = 1/2, and a = 2(r — 2)/(r — 1), 

when r £ (2, 3), since (3 = 1 and ^j^f = (r — 2)/(r — 1). This completes the proof for the hopcount. 

In the description of a in (|4.31|) . we note that when a n contains a slowly varying function for r £ (2, 3) 
as in (|4.26p , then the result in Theorem 13.21 remains valid with a log n replaced by 

2 log an = 2<yT ~ ^ log n + 2 log t (n) ■ (4.32) 
r — 1 

For the weight of the minimal path, we make use of (|4.20p and (I4.29|) to obtain in a similar way that 

W n - 2 7 log a n ^X 1 +X 2 . (4.33) 
This completes the proof for the weight of the shortest path. 



5 Proof of Proposition 14.31 
5.1 Proof of Proposition 14.3( a) 

We start by proving the statement for r £ (2,3). Observe that, in this context, di = Bi, and, by (|4.3|) . 
Bi + . . . + Bi = Si + i — 1, so that the sequence Bj/ (Si + i — 1), for j satisfying 1 < j < i, is exchangeable 
for each i > 1. Therefore, we define 

m o 

G m = Y,h, where P(I, = lK^}^) = * . (5.1) 
^ Oi + 1 - 1 

Thus, Jj is, conditionally on stochastically dominated by /j, for each £, which, since the sequences 

{Ii\i^L\ and {Ij}^!, conditionally on {-Bij^u each have independent components, implies that G m is 
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stochastically dominated by G m . We take G m and G m in such a way that G rn < G m a.s. Then, by the 
Markov inequality, for n m > 0, 



n\G r , 



Gm, fort 



< K^E[\G m - Gm\] = k^E [G m - G r 

m . 1 



i=l 



Bj(i-1) 

Si(Si + i-iy 



(5.2) 



i=l 



where, in the second equality, we used the exchangeability of Bj/(Si + i — 1), 1 < j < We will now 
show that 



X>[i/^; 



< oo, 



(5.3) 



i=l 



so that for any n m — ► oo, we have that P(|G m — G m | < K m ) — ► 1. We can then conclude that the 
CLT for G m follows from the one for G m . By [TJ1 (3.12) for s = 1], for r G (2,3) and using that 

= B\ + • • • Bi — (i — 1), where P(Bi > k) = k 2 ~ T L(k), there exists a slowly varying function i i— > 
such that E[l/5i] < ci(i)* -1 /0--2). when r G (2,3), we have that l/(r - 2) > 1, so that fl53D follows. 

We now turn to the CLT for G m , Observe from the exchangeability of Bj/ (Si + i — 1), for 1 < j < i, 
that for ii < Z2 < • • • < &fc) 



«[n 
Hn 

1 Z=2 



+ i{ - 1 



gj, 
5j, -Hz - 1 



E 



5, 



B, 



5 i( + i\ - 1 



Z=l 1 



(5.4) 



where we used that since Bi + . . . + Bj = Sj + j — 1, 

k 



E 



Bjj 



in 

(=2 



Bi 



S it + ij - 1 



ii ^ E [^ +ii - 1 n 

i=l 1 Z=2 



B, 



Si, +ii-l 



1 1=2 



Bi 



Si, +ii-l 



Since , . . . , Ii fc are indicators this implies that j^,...,/^ are independent. Thus, G m has the same 
distribution as X^I=i ^> where { Jj}^ 1 are independent Bernoulli random variables with P(Jj = 1) = 
It is a standard consequence of the Lindeberg-Levy- Feller CLT that (Y2iLi Ji ~ log m) / y/log m is 
asymptotically standard normally distributed. 

Remark 5.1 (Extension to exchangeable setting) Note that the CLT for G m remains valid when 
(i) the random variables {Bi\ 1 £ =1 are exchangeable, with the same marginal distribution as in the i.i.d. case, 
and (ii) X)i=i E^l/Si] = o(y / Iogm). 

The approach for r > 3 is different from that of r G (2, 3). For r G (2, 3), we coupled G m to G m and 
proved that G m satisfies the CLT with the correct norming constants. For r > 3, the case we consider now, 
we first apply a conditional CLT, using the Lindeberg-Levy- Feller condition, stating that, conditionally 
on Bl j B2, . . . satisfying 



lim 



5', 



00, 



we have that 



Cm _ Sj=l Bj/Sj d 



(EJLif(i-f)) 



1/2 



(5.5) 



(5.6) 
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where Z is standard normal. The result (|5.6p is also contained in [10j . 

Since v = > 1 and E[JB°] < oo, for any a < r — 2, it is not hard to see that the random variable 

Yl'jLi Bj/S] is positive and has finite first moment, so that for m — ► oo, 

m 

Y,B]/S% = O r (l), (5.7) 

where P (b m ) denotes a sequence of random variables X m for which | X m | / b m is tight. 
We claim that 

m 

y^Bj/Sj - - — -log to = o P (y / logm). (5.8) 
3=1 V 

Obviously, (|5.6p . (15. Tf) and (|5.8j) imply Proposition 14.3( a) when r > 3. 
In order to prove (|5.8|) . we split 



m m m 1 

E ^ " VZ\ lo § m = ( D B i " - log m) + ( 53 1/5, - — log to) , (5.9) 

i=i i=i i=i 

and shall prove that each of these two terms on the r.h.s. of (|5.9p is o v (\/Togm). For the first term, we 
note from the strong law of large numbers that 

S 4 



53 log (-i-) =lo g S m -logSo = logTO + P (l). (5.10) 

3=1 

Also, since — log (1 — x) = x + 0(x 2 ), we have that 



53 log (Sj/Sj-t) = - 53 log (i - (^ - 1)/^) = 53(5, - 1)/^ + 0(53^ - i?/s*). ( 5 .n) 
j=i j=i j=i j=i 

Again, as in (|5.7p . for to — > oo, 

m 

53(5, - l) 2 /^| = 0,(1), (5.12) 
i=i 

so that 

m 

53(5,- - 1)/^ - log to = Op(l). (5-13) 



In order to study the second term on the right side of (|5.9p . we shall prove a slightly stronger result than 
necessary, since we shall also use this later on. Indeed, we shall show that there exists a random variable 
Y such that 

m 1 

53 l/Sj - — - log to ^ Y. (5.14) 
i=i V ~ 

In fact, the proof of (|5.14|) is a consequence of [31 Theorem 1], since E[(_Bj — 1) log(-Bj — 1)] < oo for r > 3. 
We decided to give a separate proof of (|5.14p which can be easily adapted to the exchangeable case. 
To prove (I5.14p . we write 

53 m - iogm = 53 {v ~ 1)j ~ Si + aw, (5.15) 



3=1 
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so that in order to prove (|5.14p . it suffices to prove that, uniformly inra> 1, 

Thus, if we further make use of the fact that Sj > rjj except for at most finitely many j (see also Lemma 
IA.4I below) , then we obtain that 

m 1 -. m \ Q ( I'll m i q*\ 

|E^-^H^E%^+*m^e^, («■») 

j=l 3 j=l 3\ >■> j=l 

where Sj = Sj — E[Sj-], since E[«Sj] = (y — l)j + 1. We now take the expectation, and conclude that for 
any a > 1, Jensen's inequality for the convex function x i— > x a , yields 

E[\S*\] < E[|5*| a ] 1/a . (5.18) 

To bound the last expectation, we will use a consequence of the Marcinkiewicz-Zygmund inequality, 
see e.g. [UJ Corollary 8.2 on p. 152]. Taking 1 < a < r — 2, we have that E[|l?i| a ] < oo, since r > 3, so 
that 



E 



3=1 J 3=1 3 3=1 



This completes the proof of (|5.14p . ■ 

Remark 5.2 (Discussion of exchangeable setting) When the random variables {Bi} 1 ^ =1 are exchange- 
able, with the same marginal distribution as in the i.i.d. case, and with r > 3, we note that to prove a 
CLT for G rn , it suffices to prove |5. 7| ) and 15. 8\) . The proof of 115. 8\) contains two steps, namely, h5.13\) 
and Ii5.16\) . For the CLT to hold, we in fact only need that the involved quatities are o P (y/\og m), rather 
than O p (1). For this, we note that 

(a) the argument to prove \5. 13\) is rather flexible, and shows that if (i) log S m /m = o P (y/\og m) and if 
(ii) the condition in |5. ?| ) is satisfied with O p (1) replaced by o P (\/log m), then A5.13]) follows with O p (1) 
replaced by o v (^f\ogm); 

(b) for the proof of \5.16\) we will make use of stochastic domination and show that each of the stochastic 
bounds will satisfy \5. 1 6\) with O p (1) replaced by o P (y/logm) (compare Lemma 



5.2 Proof of Proposition IQKb) 

We again start by proving the result for r G (2, 3). It follows from (|4.6p and the independence of {Ei}i>i 
and {Si}i>\ that, for the proof of (|4.9p . it is sufficient to show that 

oo 

5^E[1/Si] < oo, (5.20) 

i=i 

which holds due to (|5.3p . The argument for T m is similar, with the same limit. Indeed, on the one hand, 
since T m < T m , the limit of T m cannot be larger than that of T m . On the other hand, since G m — > oo 
and m \— > T m is increasing, we have that the T m > T^, whp for any k. Therefore, T m must have the same 
limit as T m . 

The extension of this result to r > 3, where the weak limits of T m and T m are different, is deferred to 
Section [C] of the appendix. 
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6 Proof of Proposition 14.6 



In this section, we extend the proof of Proposition 14.31 to the setting where the random vector {Bi} 7 ^ =2 * s 
not i.i.d., but rather corresponds to the vector of forward degrees in the CM. 

In the proofs for the CM, we shall always condition on the fact that the vertices under consideration 
are part of the giant component. As discussed below (13. 3|) . in this case, the giant component has size 
n — o(n), so that each vertex is in the giant component whp. Further, this conditioning ensures that 
Sj > for every j = o(n). 

We recall that the set up of the random variables involved in Proposition 14.61 is given in (|4.12p and 
(I4.13|) . The random variable R m , defined in (|4.10j) . is the first time the SWG/j m consists of m + 1 real 
vertices. 

Lemma 6.1 (Exchangeability of {-Bi? m }^=i) Conditionally on{Di}f =1 , the sequence of random vari- 
ables {Bji^^Ii i s exchangeable, with marginal probability distribution 

PniBai=j) =± o + 'iv.^ (fU) 

i=2 n 1 

where P n denotes the conditional probability given {Z)j}" =1 . 

Proof. We note that, by definition, the random variables {Bji m }^L 1 are equal to the forward degrees 
(where we recall that the forward degree is equal to the degree minus 1) of a vertex chosen from all 
vertices unequal to 1, where a vertex i is chosen with probability proportional to its degree, i.e., vertex 
i £ {2, . . . ,n} is chosen with probability Pi = Di/{L n — D\). Let K2, ■ ■ ■ ,K n be the vertices chosen, 
then the sequence K2,...,K n has the same distribution as draws with probabilities {Pi}™ =2 without 
replacement. Obviously, the sequence (K2, ■ ■ ■ , K n ) is exchangeable, so that the sequence {Bn m } , ^L 1 , 
which can be identified as B^ m = Dx m+1 — 1, inherits this property. ■ 

We continue with the proof of Proposition 14.61 By Lemma 16.11 the sequence {-Bj}JL 2 is exchangeable, 
when we condition on |Artj| = for all j < m. Also, |Artj| = for all j < m holds precisely when 
R m = m. In Lemma I A. II in Appendix A, the probability that R mn = m n, for an appropriately chosen 
m n , is investigated. We shall make crucial use of this lemma to study Grn n • 

Proof of Proposition [4761 Recall that by definition log(m n /a n ) = o(y / logn). Then, we split, for some 
m n such that log(a n /m n ) = o( v / Iogn), 

Gm n = Grn n + [Gm„ — G^J, (6.2) 

where Gm n has the same marginal distribution as Gm n , but also satisfies that G m ^ < G„„, a.s. By 
construction, the sequence of random variables m \— > G m is stochastically increasing, so that this is 
possible by the fact that random variable A is stochastically smaller than B if and only if we can couple 
A and B to (A, B) such that A < B, a.s. 

Denote by A m = {R m = m} the event that the first artificial stub is chosen after time m. Then, by 
Lemma [A. H we have that P(v4^ ) = o(l). Thus, by intersecting with Am n and its complement, and then 
using the Markov inequality, we find for any c n = o(^/logn), 

F(|GW n - G m J > cn) < -E[|GW„ - G^JUJ + o(l) (6.3) 

= ^[[G mn -G I!kn }t Amn ]+o{l) 



n ■ 11 



+ o(l). 
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We claim that 



E E 

i=m„+l 



n 



(6.4) 



Indeed, to see (|6.4p . we note that Bi = 0, when i ^ Rj for some j. Also, when A m occurs, then 
R m = m n . Thus, using also that R m > m, so that Ri < fn n implies that i < m n , 



E E 



i=m_+l 



m n 

< E E 



h 



i - 1 



-E 



g -{m n +l<-Ri<m n } 
]1 {nin+ 1 <^<" T «} 



the latter following from the exchangeability of {B Ri }™ =2 , because 

Ri i 

S Ri =D X + J^iBj -1)=D 1 + J2 B Rj - {Ri - 1), 

3=2 j=2 

so that 

i 

B R 3 = S R> -Dx + Ri-IK S Ri + Ri. 

3=2 

In Lemma lA.2l of the appendix we show that there exists a constant C such that for i < fn n , 



E 



so that, for an appropriate chosen c n with log (rn n /m n )/c n — > 0, 



(6.5) 



(6.6) 



(6.7) 



Grn n ~ G m J >C n j<— ^2 



C 



1 C log (m ra /m J 

< ; = o(l), 



i=m„+l 



i - 1 



(6.8) 



since log (rn n /m n ) = o(yl"ogn). Thus, the CLT for Gm„ follows from the one from Gm n , which, since the 
marginal of G m?i is the same as the one of G m , follows from the one for G m . By Lemma [A. 1\ we further 
have that with high probability, there has not been any artificial stub up to time m n , so that, again with 
high probability, {i? m }^™ 2 = {^Kmlm=2' * ne latter, by Lemma [6TTT being an exchangeable sequence. 

We next adapt the proof of Proposition 14.31 to exchangeable sequences under certain conditions. We 
start with r £ (2,3), which is relatively the more simple case. Recall the definition of G m in (|4,13p . We 
define, for i > 2, 



S i = ^B j = S i + i-l-D 1 . 

3=2 

Similarly to the proof of Proposition 14.31 we now introduce 



(6.9) 



Gn 



l + jHli, where P(I; = l|{Bi}£ 2 ) = 2 < i < m. 



(6.10) 



i=2 



Let Qi = Bi/Si,Qi = Bi/S%. Then, by a standard coupling argument, we can couple L L and Ii in such a 



way that P(/^ 



m \ 
i=2> 



\Qi Qi\- 
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The CLT for G m follows because, also in the exchangeable setting, I2, . . . , J m are independent and, 
similar to (15.21). 



m m 

'(|G m - G m \ > k„) < ^EflG^ - G m \] < k^EIJ^ \h ~ h\] = < X>[|4 - Q»| 



i=l i=2 



Si Si 



£>! + (<-!)- 



i=2 i=2 
m 



= ^ E jh ^^s'^ } = ^ E ( E [v*] + -^[zv^]) 

i=2 1 i=2 

m ^ 

< K" 1 (nV(Si -D l + 2)] + _E[2V(Si - Di + 2)]) , (6.11) 



where we used that D\ > 2 a.s. We take m = m n , as discussed above. Since D\ is independent of 
Si — D\ + 2 for i > 2 and E[Di] < 00, we obtain the CLT for G m from the one for G m when, for 
re (2,3), 

™n i 

^E[1/SJ = O(l), where 2^ = 1 + ^(^-1), i > 1. (6.12) 

i=l j=2 

In Lemma [A. 21 of the appendix we will prove that for r £ (2, 3), the statement f|6. 12j) holds. The CLT for 
Gji— n follows in an identical way. 

We continue by studying the distribution of T m and T m , for r £ (2,3). We recall that T m = 
YliLi Ei/ Si, (see (|4.6p ). In the proof of Proposition 14.31 (b) for r £ (2,3), we have made crucial use 
of (|5.20p . which is now replaced by (|6.12p . We split 

m n p m 

T m = Y, Ei/ Si = Et/Si + Ei/Si. (6.13) 

i=l i=l i>nP 

The mean of the second term converges to for each p > by Lemma IA.21 while the first term is by 
Proposition 14.71 whp equal to Ya=i Ei/S^ ad \ where Sj- = Y^j=iBj\ and where B^ nd) = D\, while 
{_Br nd) }^ 2 is an i.i.d. sequence of random variables with probability mass function g given in (|2.3|) . which 
is independent from D\. Thus, noting that also Y^i> n p Ei/ S^ ud) — > 0, and with 

00 

X = Y,Ei/S!/ nd \ (6.14) 

i=l 

we obtain that T rn — — > X. The random variable X has the interpretation of the explosion time of the 
continuous-time branching process, where the degree of the root has distribution function F, while the 
degrees of the other vertices is an i.i.d. sequence of random variables with probability mass function g 
given in (|2.3|) . A similar argument holds for T m , with X = X. This completes the proof of Proposition 



2] for r £ (2, 3), and we turn to the case r > 3. 

For r > 3, we follow the steps in the proof of Proposition 14.3( a) for r > 3 as closely as possible. 
Again, we apply a conditional CLT as in (|5,6p . to obtain the CLT when (|5,5p holds. From Lemma lA.5l we 
conclude that (|6.7p also holds when r > 3. Hence, as before, we may assume by Lemma |A. lj , that whp, 
there has not been any artificial stub up to time m n , so that, again whp, {fi m }~!L 2 = {-B_R m }m=2' the 
latter, by Lemma EH being an exchangeable sequence. For the exchangeable sequence \B rn \^L 2 we wn l 
then show that 

m. n 

5>?/S? = 0,(l). (6.15) 

i=2 



23 



The statement (|6.15p is proven in Lemma IA.6I 

As in the proof of Proposition 14.3( a). the claim that 

^2 B i/ S i ~ 7— [ log m n = Op(yiogmJ, (6.16) 
i=2 

is sufficient for the CLT when r > 3. Moreover, we have shown in Remark 15.21 that (|6.16j) is satisfied, 
when 

log {S m Jm n ) = o P ( v / logm n ), (6.17) 

and 

^ %~^-l)j J = (6-18) 

The proof of (|6.17p and (|6.18p are given in Lemmas IA.7I and Lemma IA.81 of Appendix A, respectively. 
Again, the proof for Gr— u is identical. ■ 

For the results for T m and T m for r > 3, we refer to Appendix O 



7 Proof of Proposition 14.91 

In this section, we prove Proposition 14.91 We start by proving that logC n /a n = o P (\/logn), where C n 
is the time at which the connecting edge appears between the SWGs of vertices 1 and 2 (recall (|4.17|) ). 
as stated in (|4.27p . As described in Section [4.41 we shall condition vertices 1 and 2 to be in the giant 
component, which occurs whp and guarantees that Sm > for any m = o(n) and i G {1,2}. After 
this, we complete the proof of (|4.28p - (|4.29p in the case where r G (2,3), which turns out to be relatively 
simplest, followed by a proof of (|4.28|) for r > 3. The proof of (|4.29p for r > 3, which is more delicate, is 
deferred to Appendix C. 

We start by identifying the distribution of C n . In order for C n = m to occur, apart from further 
requirements, the minimal stub from SWG^ must be real, i.e., it may not be artificial. This occurs with 
probability equal to 1 — |Art^|/5m . 

By Construction [3~4l the number of allowed stubs incident to SWG^ equals S m , so the number 
of real stubs equals Sm — |Art^|. Similarly, the number of allowed stubs incident to SWG^ equals 
Sm > so the number of real stubs equals Sa,l — |Art^| Further, the number of free stubs equals |FS m | = 
L n — a n — m — S m + |Art m |, where we recall that S m = Sa„ + Sm and Art m = Art £2 U Art^, and is 
hence bounded above by L n and below by L 

n Q"n in Sm- W^hen the minimal- weight stub is indeed 
real, then it must be attached to one of the real allowed stubs incident to SWG^, which occurs with 
conditional probability given SWG^ ,2) and L n equal to 

San I ArtL ^1 

^ l n (7 ^\ 

Ln a n m S m + |Art m | 

Thus, in order to prove Proposition 14.91 it suffices to investigate the limiting behavior of L n , S m and 
|Art m |. By the law of large numbers, we known that L n — fin = o P (n) as n — > oo. To study Sm and |Art m |, 
we shall make use of results from \2A\ [25] . Note that we can write S# = A + Bj? + • • • + B% - ( m — 1), 
where {Bm }„ =2 are close to being independent. See (2U Lemma A. 2. 8] for stochastic domination results 
on {B m ] }™ =2 and their sums in terms of i.i.d. random variables, which can be applied in the case of r > 3. 
See \Zb\ Lemma A. 1.4] for bounds on tail probabilities for sums and maxima of random variables with 
certain tail properties. 

The next step to be performed is to give criteria in terms of the processes S m which guarantee that 
the estimates in Proposition 14.91 follow. We shall start by proving that with high probability C n > m n , 
where m n = e n a n , where e n { 0. This proof makes use of, and is quite similar to, the proof of Lemma 
I A. II given in Appendix A. 
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Lemma 7.1 (Lower bound on time to connection) Let m n /a n = o(l). Then, 

HCn < mj =0(1). 

Proof. Denote 



(7.2) 



£n = {S™ < a n M n }, (7.3) 

where M n = r/" 1 for r > 3, while M n = 7/~ 1 n( 3 ~ r )/' r ~ 1 ) for r G (2, 3), and where r/ n J, sufficiently slowly. 
Then, by ([Al^) for r > 3 and fOo]) for r G (2, 3), 



P(^) = o(l), 
since m n = o{a n ). By the law of total probability, 

m=l 



(7.4) 



(7.5) 



Then, we make use of (|7.ip and (|7.4p . to arrive at 



P(C„<mJ < ^E[l £ 



<_,(. 



(1) 



m=l 



L n — m — S n 



+ o(l). 



(7.6) 



As in the proof of Lemma [A. H we have that m < m n = o(n) and S rn = o(n), while L n > n. Thus, (17.6 
can be simplified to 



P(C„ < mj < i±^lm n E[%„5«] + o(l) < i±^a n m n . 
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(7.7) 



When choosing % J, sufficiently slowly, for example as r] n = \/m n /a n , we obtain that P(C n < m n ) = o(l) 
whenever rn n = o(a n ). m 

We next state an upper bound on C n : 

Lemma 7.2 (Upper bound on time to connection) Letrn n /a n — ► oo, then, 

P(C n > fn n ) = o(l). (7.8) 

Proof. We start by giving an explicit formula for P(C n > m). As before, Q^™' is the conditional 
distribution given SWG^ ,2> and {Di}f =1 . Then, by Lemma lB~Tj 



P(C n > m) = E[ JlQ«(C n > j|C n >j - 1) 

3=1 



(7.S 



Equation (|7.9|) is identical in spirit to \24\ Lemma 4.1], where a similar identity was used for the graph 
distance in the CM. Now, for any sequence e n — ► 0, let 



^ = {^|Art£| + ^)£ 



I Art 



(■j) I 



m=l 



V2> 



•}■ 



(7.10) 



By Lemma lB.3( the two terms appearing in the definition of B n in ()7.10p converge to zero in probability, 
so that P(£>n) = 1 — o(l) for some e n — > 0. Then, we bound 



P(C n > m) < E[l Sn Q«(C n > j\C n > j - 1)] + F(B c n ). 
i=i 



(7.11) 



25 



We continue by noticing that according to (|7.ip . 

Q^ ) (Cn = m|C n >m-l) 



IArt«| 



|FS r 



1 



| Art, 



(2)1 

m 



Q(2) 
>Jrn 



(7.12) 



where |FS m | is the number of real free stubs which is available at time m. Combining (|7.1ip and (|7.12p 
we arrive at 



p(c„ > m n ) 



E 



m n _ I A -+(1)1 

^ n f 1 - - ( 



m=l 



IFS 



in 



I Artffi 
q(2) 



+ o(l). 



(7.13) 



Since |FS m | < L n < n/c, whp, for some c > 0, and using that 1 — x < e x , we can further bound 

|Art< 



where 



P(C n > fn n ) < E 
< E 

e n = 0(E 



l Sn exp 5«-|Art«| £ 1 



m=l 



|-(2) 



q(2) 



}] +o(l) 



l Sn exp{-^5«}]+ en + (l), 



1ft, 



^" [Art^'l + -S^ 



^ | Art 



(2) 



m=l 



s, 



(2) 



0(e r 



Hence, 



cxp 



cm, 



S' 1 



n 



2}] +«(!)■ 



P(C n > m n ) < E 

When r > 3, by Lemma IA.4I in the appendix, we have that, whp, and for some rj > 0, 
so that 



P(C n > m n ) < exp { 



cr)a n m n 
n 



} + o(l) = (l), 



(7.14) 



(7.15) 



(7.16) 



(7.17) 



(7.18) 



as long as m n a n /n = m n j\fn — > oo. For r G (2, 3), by (|A.39P in Lemma [A.31 and using that n 1 /^" 1 )/^ 
l/a n , we have for every e n — > 0, 



P(C n > m n ) < exp{ ^} + o(l) = o(l), 



(7.19) 



whenever e n m n /a n — > oo. By adjusting e n , it is hence sufficient to assume that Tn n ja n — > oo. ■ 

Lemmas 17.11 and 17.21 complete the proof of H4.27H in Proposition 14.91 We next continue with the proof of 
(|4.28p in Proposition 14.91 We start by showing that P(C n = 0) = o(l). Indeed, C n = happens precisely 
when 2 € SWG^, which, by exchangeability, occurs with probability a n /n = o(l). 

For C n > 1, we note that at time C n , we draw a real stub. Consider the pair (G a ^ +l , GqI) conditionally 
on {C n = m} for a certain m. The event {C n = m} is equal to the event that the last chosen stub in 
SWG^ is paired to a stub incident to SWG^, while this is not the case for all previously chosen stubs. 



(i) 



For j = 1, . . . , m, and i G {1, 2}, denote by /j l> the j th real vertex added to SWG W , and denote by V r 
the number of real vertices in SWG^. Then, for m > 1, the event {C n = m} is equal to the event that 
the last chosen stub in SWG^ is paired to a stub incident to SWG^ 1 ' , and 



Uflji n{if}]£ 

As a result, conditionally on {C n , = m} and Va^ = ^1, ^m 2> = 

2) \fe2 

?j 2 J h 



0. 



(7.20) 



/c2) the vector consisting of both {B^ }j\=\ 



and {Bft\ is an exchangeable vector, with law is equal to that of k\ + ki draws from {D{ — 1} 



i=3 
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without replacement, where, for i G [n]\{l, 2}, Di — 1 is drawn with probability equal to Di/(L n — D±— D 2 ). 
The above explains the role of the random stopping time C n . 

We continue by discussing the limiting distributions of (H n , Hn ) in order to prove (I4.28p . For this, 
we note that if we condition on {C n , = m} for some m and on SWG^ ,2) , then, by (|4.19|) (Hn \ H^) = 
(G^ +1 ,Gm), where the conditional distribution of (G^ +1 ,Gm) is as two independent copies of G as 
described in (jOj) . where {dj}"^ in (jO|) is given by di = D 1 and dj = Bf \ j > 2, while, = G$ , 
where d\ = D2 and dj = B^, j > 2. Here, we make use of the fact that is the distance from vertex 
1 to the vertex to which the paired stub is connected to, which has the same distribution as the distance 
from vertex 1 to the vertex which has been added at time a n + 1, minus 1, since the paired stub is again 
a uniform stub (conditioned to be real). 

Thus, any possible dependence of (Hn , H n 2) ) arises through the dependence of the vectors {i?j 1> }^ =2 
and {B { -p}JL 2 - However, the proof of Proposition 14.61 shows that certain weak dependency of {Bj 1) } < ^ =2 
and {B { p}f =2 is allowed. 

We start by completing the proof for r G (2,3), which is the more simple one. Recall the split in 
(|5.ip . which was fundamental in showing the CLT for r G (2,3). Indeed, let and {/j 2) }^2 :1 be 

two sequences of indicators, with l[ r> = 1{ 2) = 1, which are, conditionally on {Bf ] }f =2 and {Bj 2) }?L 2 , 
independent with, for i G {1,2}, 

P(/f = l\{Bf}f =2 ) = Bf/(Sf+j - 1 - Di). (7.21) 

Then, the argument in (|5.4p can be straightforwardly adapted to show that the unconditional distributions 
of Uj 1 '}^ 2 and {If ) }f=2 are that of two independent sequences {Jf ] }f =2 and {Jf ] }f =2 with P(jj l) = 
1) = l/(j — 1). Thus, by the independence, we immediately obtain that since C n — ► 00 with log(C n /a n ) = 
o P (yiogn), 

loga„ H^-(3\oga n \ d 



(Zi,Z 2 ). (7.22) 

V/3 log a n V/3 log a n / 

The argument to show that, since C n < m n , (Hn \ H^) can be well approximated by (Gm , Gm ) (recall 



(16. 2p ) only depends on the marginals of (Hn\ Hn ), and thus remains valid verbatim. We conclude that 
IgZggD holds. 

We next prove (|4.29p for r G (2,3). For this, we again use Proposition 14.71 to note that the forward 
degrees {Bj}™^ can be coupled to i.i.d. random variables {-B$ md) }™^3, which are independent from B\ = 
D±,B 2 = D 2 . Then we can follow the proof of Proposition 14.6( b) for r G (2,3) verbatim, to obtain that 

(Ta^,T^) (Xi,X 2 ), where X±,X 2 are two independent copies of X in (|6.14p . This completes the 
proof of Proposition 14.91 when r G (2, 3). 

We proceed with the proof of Proposition l4.9l when r > 3 by studying (H n : H n )■ We follow the proof 
of Proposition 14.6( a). paying particular attention to the claimed independence of the limits (Zi,Z 2 ) in 
(|4.28p . The proof of Proposition 14. 6f a) is based on a conditional CLT, applying the Lindeberg-Levy-Feller 
condition. Thus, the conditional limits (Zi,Z 2 ) of 



HP - E-=2 Hf } /Sf H n 2) - Bf/S™ 



( a B (1) B (1) \l/2' / B ( 2 ) B V) \l/2 



(7.23) 



are clearly independent. The proof then continues by showing that the asymptotic mean and variance can 
be replaced by (3 log n, which is a computation based on the marg inals {Bf ] }f =2 and {Bf ] }f =2 only, and, 
thus, these results carry over verbatim, when we further make use of the fact that, whp, C n G \jn n ,rn n ] 
for any m n ,m n such that log (rn n /m n ) = o(i/log n). This completes the proof of (j4.28j) for r > 3. The 
proof of (|4.29p for r > 3 is a bit more involved, and is deferred to Section [Cl 
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A Appendix: auxiliary lemmas for CLTs in CM 

In this appendix, we denote by B\ = D\, the degree of vertex 1 and B 2 , ■ ■ ■ ,B m , m < n, the forward 
degrees of the shortest weight graph SWG m . The forward degree B^ is chosen recursively from the set 
FSfc, the set of free stubs at time k. Further we denote by 

k 

3=2 

the number of allowed stubs at time k. As before the random variable R m denotes the first time that the 
shortest path graph from vertex 1 contains m + 1 real vertices. Consequently 

Br 2 , . . . , B Rm , 

m < n, can be seen as a sample without replacement from the degrees 

D 2 -l,D 3 -l,...,D n -l. 

A.l The first artificial stub 

We often can and will replace the sample B 2 , . . . , Bm n , by the sample Br 2 , . . . , B^ . The two samples 
have, whp, the same distribution if the first artificial stub appears after time m n . This will be the content 
of our first lemma. 

Lemma A.l (The first artificial stub) Letm n /a n — > 0. Then, 

nRrn n > ELn) = o(l). (A.l) 

Proof. For the event {Rm n > EL n } t° happen it is mandatory that for some m < m n , we have R m > m, 
while Rm-i =m — l. Hence 

F n (R mn >m n ) = Y J ^n(R m > m, i? m _i = m - 1). (A.2) 

m=2 

Now, when R m > m, R m -i = m — 1, one of the <S m -i stubs incident to SWG m _i has been drawn, so that 

F n (R m > m,R m _ x = m- 1) = E n [- ^ m " 1 , H {flm _ 1=m -i } ] ■ (A.3) 

Since m n = o(n), we claim that, with high probability, S m -\ = o(n). Indeed, the maximal degree is 
O r (n 1 ^ T ~ 1 ^), so that, for m < m n , 

S m < Ovimn 1 '^) < P (rn n n 1/(r - 1} ) = o w {n), (A.4) 
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since, for r > 3, a n = n 1 / 2 and n 1 ^ T ~ 1 ^ = o(n 1 / 2 ), while, for r G (2,3), a n = n (T_2 ^ (T_1) , so that 
m n n l '^ r ~ 1 ' = o(n). Moreover, L n > n, so that 

C 

Pn(-Rm > m,i? m _i = m - 1) < — E n [5 m _ill{^ _ 1=m _i}]. (A.5) 

By the remark preceding this lemma, since R m —\ — m — 1, we have that S m —i — D\ -\- ^2j = 2 (^i?^ — l) ? 
so that, by Lemma I67T] 

P„(ii m > m, i? m _i = m - 1) < -D x + ^- E n [B R2 ]. (A.6) 

n n 

The first term converges to 0, while the expectation in the second term, by (|6,ip . equals 

^-t^T- (A ' 7) 

When r > 3, this has a bounded expectation, so that, for a n = y/n, 

Sin p ?Hn pi _ n\ 2 

¥(R mn >m n ) < V-E^l + V-^ ^E[B fl2 ] < - 0. (A.8) 



n ' — ' n n 

m=2 m=2 

When r G (2, 3), however, then E[L> 2 ] = 00, and we need to be a bit more careful. In this case, we obtain 
from fAbl that 

9 

ID 

V n (Rm n > mj < C^E n [5 fl2 ]. (A.9) 
— n 

From (|A.7p . and since L n — D\ > n — 1, 

e„[^ 2 ] < -°— aca - 1) < -^r E ( A - 10 ) 

n — 1 ^-^ n — 1 ^-^ 

i=2 i=2 

From (|3.8p . we obtain that x^ r_1 ^ 2 P(D 2 > x) G [01,02] uniformly in x > 0, and since Di,D%, . . . ,D n 
is i.i.d., we can conclude that n" 2 /^" 1 ) YTi=2 D i 

converges to a proper random variable. Hence, since 

a n/lR n -»oowe obtain, whp, 

E n [B Ro ] < ^n 2 /^ 1 )" 1 = ^LnO-^/^-i). (A.ll) 

Combining (jAT9|) and (jATTT) . and using that a n = n^ -2 )/^ -1 ) we obtain that, whp, 

^n{Rrn n > rn n ) < Ca n m n n^~ l = C^n^ 1 ^- 1 = = o P (l). (A.12) 

Cl n CL n 

This proves the claim. ■ 

A. 2 Coupling the forward degrees to an i.i.d. sequence: Proposition 14.71 

We will now prove Proposition 14.71 To this end, we denote the order statistics of the degrees by 

D W <D (2) < ...<D (n) . (A.13) 

Let m n — > 00 and consider the i.i.d. random variables Xy, X-,, . . . , X_ m , where X^ is taken with replace- 
ment from the stubs 

D {1) -1,D W -l,...,D (n _ mn) -1, (A.14) 
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i.e., we sample with replacement from the original forward degrees D\ — 1, D2 — 1, . . . , D n — 1, where the 
m n largest degrees are discarded. Similarly, we consider the i.i.d. random variables X±, X2, ■ ■ ■ , X mn , 
where X, is taken with replacement from the stubs 

^(m„+l) ~~ 1) D (ron+2) ~~ 1) • • • i ^(«) ~~ 1) (A. 15) 

i.e., we sample with replacement from the original forward degrees D\ — 1, D2 — 1, ... , D n — 1, where the 
m n smallest degrees are discarded. Then, obviously we obtain a stochastic ordering X_ { < st Bi < st Xi, 
compare \24\ Lemma A. 2. 8]. As a consequence, we can couple {-B^}™'^ to m n i.i.d. random variables 
{2[i}S. _1 , (Xij^X 1 such that, a.s, 

X x _ x <B t < X^. (A.16) 

The random variables {X i } 7 ^I l 1 ~ 1 , as well as {Ai}™^ -1 are i.i.d., but their distribution depends on m n , 
since they are draws with replacement from D\ — 1, ... , D n — 1 where the largest m n , respectively smallest 
m n , degrees have been removed (recall (jA. 14h ) . Let the total variation distance between two probability 
mass functions p and q on N be given by 

^ 00 

d T v(p,q) = ^^\Pk-Qk\- (A. 17) 

k=0 

We shall show that, with g and ~g, respectively, denoting the probability mass functions of X_ i and Xi, 
respectively, there exists p' > such that whp 

d TV (g {n) ,g)<n~P\ d TV (g w , g) < n- p ' . (A. 18) 

This proves the claim for any p < p' , since (|A.18P implies that d TV (g'- n \'g^) < 2n~ p ' , so that we can 
couple {Aj^Tf 1 and {Aj^f 1 in such a way that P({AJ™" 1 = {AJ^) < 2m n n-p' = o(l), when 
"m-n = n p with p' < p. In particular, this yields that we can couple {-Bj}™ n 2 to {A,)™^" 1 in such a way 
that {5i}™ n 2 = {AJ™ 7 ^ 1 wh P- Then, again from (|A.18j) . we can couple {AJ^ -1 to a sequence of i.i.d. 
random variables {Bf^}^ 1 such that {Aj™"f 1 = {Bf^}?^ -1 whp. Thus, (IAT81) completes the 
proof of Proposition 14.71 

To prove (1A.18|) . we bound, 

d TV (g (n \g) < d TV (g^\g^) + d TV (g^,g), (A. 19) 

and a similar identity holds for d TV 0? ( ™\ g), where 



9k' = ^iZ^ k + l ) 1 {D 3 =k + iy (A.20) 



- — 11 ■ 1 

in |24[ (A. 1.11)], it is shown that there exists 0.2,^2 > such that 

¥(d TV (g (n) ,g) > n" Q2 ) < n~^. (A.21) 
Thus, we are left to investigate d TV (g < - n) , g {n) ) and d TV (g (n) , <7 (n) ). We bound 



1 00 

dxv(9 ( " ) ,5 ( " ) ) = ^El£ (n; 



fc=0 



9k 

j j n—m n n 

<E( fc + 1 )(r-r) S v^+i}+E( fe + 1 )^- E 

fc=0 — n n i=l fc=0 — n j=n-m n +l 

r . n-m n 1 n n 

^(tt)i:».4 e >^(^)=t-. e >. < a -> 



i=l j=n-m n +l j=n-m n +l 
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where L n = ^JiJi' n . Define b n = Q(n/m n ) 1 ^ T ^. Then, from 1 — F(x) = x ( T ^L(x), and 
concentration results for the binomial distribution, we have, whp, D (n _ mn+1) > b n , so that, whp, 



L n ~ L.n 



E ^^rt D iVi>M- (A. 23) 



J J ^ W — J 

- n ~ n j=n-m n + l ~ n j=l 



Now, in turn, by the Markov inequality, 

F (-rJ2 D ^>bn}>^b 2 - T ) <n- £ 6;- 2 E[— ^^dl {D ,> M ] <Cn- £ , (A.24) 

so that 

F(d TV (g^\g^) > n £ 6"(- 2 )) = o(l). (A.25) 

Thus, whp, d TV (g < - n \ g^) < n £ (m n /n)( T ~ 2 ^( T ~ 1 \ which proves (|A. 18|) when we take m n = n p and 
p' = (1 — /o)(r — 2)/(r — 1) — e > 0. The upper bound for d TV (g { "\ g {n) ) can be treated similarly. ■ 

A. 3 Auxiliary lemmas for 2 < t < 3 

In this section we treat some lemmas that complete the proof of Proposition 14.6( a) for r £ (2,3). In 
particular, we shall verify condition (ii) in Remark 15.11 



Lemma A. 2 (A bound on the expected value of 1/Si) Fix r S (2,3). For m n ,m n such that 
log (a n I m n ) , log (m n /a n ) = o(\/log n) and for b n such that b n — > oo, 



(z)^E[l/£;] = 0(1), (ii) = o(l), and (in) sup E[(^/5 R J]l { ^ +1 < iJi <^ } ] < oo. 

i=i i=b n 

(A.26) 

Proof. Let m n = o(a n ). Let 



Mi = max (Sj - 1). (A.27) 



Then, we use that, for 1 < i < m 



Ei = 1 + ~ X ) - max (^J - !) - (« - 2 ) = M « - - 2 )- (A.28) 

3=2 2 " i - i 

Fix S > small, and split 

E[l/Si] < E[l/S,i {Si < 4l+n ] +E[l/S i l {Si> , 1+n ] < P(Si < i 1+5 ) + r( 1+5 ). (A.29) 

Now, if Si < i 1+s , then M» < i 1+8 + i < 2i 1+<5 , and Ej < i 1+s + i < 2i 1+8 for all j < i. As a result, for 
each j < i, the conditional probability that Bj — 1 > 2i 1+l5 , given Sj_i < 2i 1+5 and {-D s }™ =1 is at least 

-i n n 

— J2D s l {Ds>2i i +S} > 2i 1+s Y J hD s >2i^}/L n = 2i 1+s Bm(n,l-F(2i l+5 ))/L n . (A.30) 

^ U 3=1 S = l 

Further, by (jlTS]) . for some c> 0, n[l - F(2i 1+<5 )] > 2crai-( 1+<5 )( r ~ 1) , so that, for i < m n = o(n (T_2)/(T - 1) ), 
(i+tf)(T— l) > n e £ or some ^ > 0. We shall use Azuma's inequality, that states that for a binomial 
random variable BIN(iV,p) with parameters N and p, and all t > 0, 

P(BIN(iV,p) < JVp-t) < expj-^j . (A.31) 
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As a result, 

p(BIN(n, 1 - F{2i 1+5 )) < E[BIN(n, 1 - F(2i 1+<5 ))]/2) < e -™[i-^(2* 1+ *)]/2 < e -n^ 

so that, with probability at least 1 — e _n " , 

1 n 

-Y^D s l {Ds>2ll+ s } > c^ l+& ^ 2 \ (A.33) 



s=l 



Thus, the probability that in the first i trials, no vertex with degree at least 2i 1+s is chosen is bounded 
above by 

'l_cr(l+«)(r-2)y +e -n« ^e-ciMi+flC-fl +e _ n . j (A _ 34) 



where we used the inequality 1 — x < e x , x > 0. Finally, take 5 > so small that 1 — (1 + 5) (r — 2) > 0, 
then we arrive at 

E[l/S*] < + e-- 1 -^" 2 ' + e ~ n \ (A.35) 

which, when summed over i < m n , is O(l). This proves (i). For (ii), we note that, for any b n — > oo, the 
sum of the r.h.s. of (jA.35|) is o(l). This proves (ii). 

To prove (iii), we take log (a n /m n ), log (m n /a n ) = o(y/log n). We bound the expected value by 

m n E[(l/S' iJi )l {mri+1 < i?i < mn} ]. 

For m n + 1 < i < rn n , 

i i 

S i = D 1 + J2(Bj - 1) > 1 + J2( B i ~ !) = ^ ( A - 36 ) 

and the above derived bound for the expectation E[l/£j] remains valid for m n + 1 < i < fn n , since also 
for i < m n , we have ni~^ l+5 ^ T+l ^ > n £ ; moreover since the r.h.s. of (|A.35|) is decreasing in i, we obtain 

l-i , r\ 1-(1+S)(t-21 , 

E [1/Si] < m~ (1+5) + e" c ^ + e" n . (A.37) 



Consequently, 

/ it , r\ 1-(1 + <5)(t-2) 

m„E[(l/S i?i )l {mri+1 <^< 7?Tn} ] < m„(m-( 1+5 ) + e" c ^ + e~" J = o(l), (A.38) 

using that log (a n /m n ), log (m n /a n ) = o(\/logn). This proves (iii). ■ 
Lemma A. 3 (Bounds on £W n ) t G (2,3). T/ien, whp, /or e^ery e n — ► 0, 

S an >e„n 1/(T " 1) , (A.39) 

while, whp, uniformly for all m < m n , 

< e-W 3 -^- 1 ). (A.40) 
Proof. We prove (|A.39j) by noting that, by (|A.33p and the fact that e n [ 0, 

^ — ( T — 2 ) 

E^ViW/t-)} ^ cn(e n n 1 /(-D) = ( C n/a n )^- 2 ). (A.41) 

i=l 

Therefore, the probability to choose none of these vertices with degree at least e n n 1 ^ T ~ 1 ^ before time a n 
is bounded by 

(1 - ce-(r-2) n -(2-r)/(T-l))0„ < e -^ (r " 2) = ^ ( A _ 42 ) 
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for any e n j 0. In turn, this implies that, whp, S Qn > e n n 1 ^ T ~ l ^ > — a n > e n n 1 ^ T ~ 1 ^ /2, whenever e n is 
such that e n n l ^ T ~ 1 ^ > 2a n . 

To prove (|A.40|) . we use that, whp, D (n) < e~ l n l ^ T ~ l '> for any e n — ► 0. Thus, whp, using the 
inequality L n > n, 

n 

E n [S m ] < mE n [B 2 ] <^J2 D ^ D i ~ ^{D;^ V/C-D}- (A.43) 

i=i 

Thus, in order to prove the claimed uniform bound, it suffices to give a bound on the above sum that 
holds whp. For this, the expected value of the sum on the r.h.s. of (|A.43P equals 

<c 2 n V f-* < -SZ-ne-^nWtr-V. (AM) 

3 T 

3=1 

Since r E (2, 3), £,^ -2 — ► oo, so that uniformly for all m < m n , by the Markov inequality, 

< c 2 £- (2 - r) = o(l). (A.45) 

This completes the proof of (|A.40|) . ■ 
A. 4 Auxiliary lemmas for r > 3 

In the lemmas below we use the coupling (]A.16p . We define the partial sums and Si by 

i-l i-l 
£ = 5 i = X)(X i -l), i>2 (A.46) 

j=i j=i 

As a consequence of ()A.16p . we obtain for i > 2, 

i 

^ < Y, i B i - !) ^ S^ a.s. (A.47) 

i=2 

Lemma A. 4 (A conditional large deviation estimate) Fix r > 2. Then whp, there exist a c > 
and r\ > sufficiently small, such that for all i > 0, and whp, 

^n{S_i < 7]i) < e~ c \ (A.48) 

The same bound applies to Si. 

Proof. We shall prove (|A.48p using a conditional large deviation estimate, and an analysis of the 
moment generating function of Xy, by adapting the proof of the upper bound in Cramer's Theorem. 
Indeed, we rewrite and bound, for any t > 0, 

Pn(£i < ¥) = P»(e-^* > e-**) < (e'Vn(t))*, (A.49) 

where 4> n (t) = E n [e~*(— is the (conditional) moment generating function of X_y — 1. Since X_ 1 — 1 > 0, 
we have that e~ t<y — < 1, and X_y B> where B has the size-biased distribution in (|2.3p . Therefore, 
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for every t > 0, cft n (t) — > 4>(t), where cp(t) = E[e _t ^ -1 ^] is the Laplace transform of B. Since this limit 
is a.s. constant, we even obtain that (ft n (t) 4>{t). Now, since K[B] = v > 1, for each < n < E[_B] — 1, 
there exists a t* > and e > such that e _r ^ '(j)(t*) < 1 — 2e. Then, since e r ^ '4>n(t*) — > e l * v (p(t*), whp 
and for all n sufficiently large, |e** T, n (£*) - e** ?? ^(t*)| < e, so that e~** ,? (/) n (t*) < 1 — e < 1. The proof for 
follows since Si is stochastically larger than 5^. This completes the proof. ■ 



Lemma A. 5 Fix r > 3. For m n ,m n such that log (m n /a n ), log (a n /m n /) = o( v / log n), 

sup E[i? i /S , il .]l fen+1 < il .< Wn} ] < oo. 



Km. 



(A.50) 



Proof. Take m „ + 1 < A; < m n and recall the definition of E& < Sk in (16. 12f) . For n > 0, 

E[fc/E fc ] = E[fc/E fc ]l {Sfe< ^ } + E[fc/S fe ]fl {s > r , fc} < E[fc/S fe ]fl {Sfe<J)fc} + ^ 
< /cP(E fc < rjk) + r?" 1 < k¥(S k < rjk) + n~ l , 

since E^ = 1 + X^=2(^j ~~ 1) > $-k> a - s - Applying the large deviation estimate from the previous lemma, 
we obtain 

E[fc/E fe ] < t?" 1 + ke~ C2k , 

for each m n + 1 < k < fn n . Hence, 

sup E[Ri/S Ri l {lRn+1 < Ri < mn} } < rf 1 +m n e" 



i<m n 



(A.51) 



Lemma A. 6 Fix r > 3, and let m n be such that log (a n /m n ) = o(ylogn). Then, for each sequence 
C n — >• oo, 



Consequently, 



Y,B]/S]>C n )^0. 

3=2 



J2b]/s] = o v (i). 

3=2 



(A.52) 



(A.53) 



Proof. If we show that the conditional expectation of Sj=2 Bj/Sj, given {Di}f =1 , is finite, then (|A.52|) 
holds. Take a E (l,min(2,r — 2)), this is possible since r > 3. We bound 



E,. 



< 2 E n 



Si 



+ 2E r , 



<2 E n 



Si 



+ 2E n 



(A.54) 



By stochastic domination and Lemma lA.41 we find that, whp, using a > 1, 



r 1 



i=2 



< oo. 



We will now bound (|A.52|) . Although, by definition 



S j =D 1 + J2(B i -l), 



i=2 
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for the asymptotic statements that we discuss here we may as well replace this definition by 



Sj = t^Bi - 1) 



(A.55) 



i=2 



and use exchangeability, so that 



E,, 



En 



Bo - 1 \ « 



Si 



since for each j, we have Bj Q 1 = Furthermore, for j > 2, 



E, 



■/B2 - 


<E n 


[(TT] 


LI 5, J . 




A b 3 j J J 



where S3 j = (B% — 1) + • • • + (Bj — 1) . Furthermore, we can replace S3 j by S 3 • = ( X 3 — 1) + . . . + (Xj — 1) , 
which are mutually independent and sampled from D {1) — 1, ... , D {skn) — 1, as above and which are also 
independent of B 2 . Consequently, 



r«.[(¥)l^>.o=M( 



Bj-U^ 



■Bo - l\ a 



3=2 



3=2 



s, 



<E„ 



B 2 - l\o 



S 2 



+ 



— " r/ 



5=3 



— r 

l + E n [( J B 2 -l) a ]^E n [ 

3=3 



B 2 -l\ a 

1 \o- 



3=2 ' 3 



< 1 + 



EM 



3=3 



£ 2 - i\ a 



^3,i 



&,3 



(A.56) 



Finally, the expression YSj^^n Vt^f, j 



can be shown to be finite as above. ■ 
Lemma A. 7 (Logarithmic asymptotics of Sm n ) Fix r > 3, and let m n be such that log (a n /m n ) = 

(A.57) 



o( v / log n). Then, 



log S Sn - logm n = o P (v / logm n ), 
Proof. As in the previous lemma we define w.l.o.g. Sj by (IA.55j) . Then, 

Sj ^st Sj, 

where Sj is a sum of i.i.d. random variables X{ — 1, where the Xi are sampled from D\, . . . ,D n with 
replacement, where m n of the vertices with the smallest degree(s) have been removed. Using the Markov 
inequality, 



P n (^log(S™ n /mJ > c n j = ¥ n [S m Jm n > e c "J < e^E^S^/m^ = e- c "E n [A^ - 1]. (A.58) 

We shall prove below that, for r > 3, E n [Xi] — > v so that 

E„[S m ] < i/m(l + Op(l)). 
Indeed, from |24(. Proposition A. 1.1], we know that there are a, (5 > 0, such that 

P(K - f| > < ™ _/3 , 



(A.59) 



(A.60) 
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where 

OO OO XL Tl 

»n = E^r = em + i)t- e %=i + i} = r 2> (A - l} - 

-, -, - Lj n ■ , J-i n ■ , 

j=l j=l i=l i=l 

Define V n = M n [Xi\. Then we claim that there exists a,j3 > such that 

P(\V n - v n \ > n~ a ) < n- 13 . 
To see (1X621) . by definition of V n = E n [Zi], 



(A.61) 



(A.62) 



< 



=T E Ao(Ai>-i)--rEA(A-i) 

i=m n + l i=l 
n - n 

=- ^ D {i) (Z> w -i)_ £ D (i) (Z> (i) -l) 

«=m n +l l =I21 rl + 1 
^ n \ n 

- E Ai)(A.)-i)- r EA.)(Ao-i 



i=m_+l 



(A.63) 



The first term on the r.h.s. of (|A.63jl is with probability at least 1 — n 13 bounded above by n a , whp, 
since it is bounded by 

^Ea(a-i), 



i=l 



and since, using (|A.23P and (|A.24|) . (L n — L n )/L n = o P (n a ) for some a > 0. The second term on the 
r.h.s. of (|A.63j) is bounded by 



L 



i=i 



(A.64) 



since r > 3. This completes the proof of (|A.62p . Combining (|A.58|) with c n = o(^logm n ) and the fact 
that E n [Xi] — > ^, we obtain an upper bound for the left-hand side of (|A.57|) . 

For the lower bound, we simply make use of the fact that, by Lemma I A. 41 and whp, S mn > wn n , so 
that log Srn n ~ log m n > log 7? = o P ( v /logm n ). ■ 



Lemma A. 8 Fix r > 3, and Zei m „ 6e such that log (a n /m n ) = o(\/logn). Then, 



^ S 3 -{v-l)j _ v r l l 
Proof. We can stochastically bound the sum (|A.65P by 



,(y/\ogm n ). 



(A.65) 



r 1 1 1 r 1 1 1 r 1 



i i 

j L (1/^157-^ 



(A.66) 



We now proceed by proving (|A.65p both with Sj replaced by Sj, and with Sj replaced by S_j. In the proof 
of Lemma lA. 71 we have shown that E n [Xi] converges, whp, to v. Consequently, we can copy the proof of 
Proposition 14.3( a) to show that, whp, 



E^ 



S3 -iy- 1)3 



i Sj(u-i)j 



op(v^ogm^). 



(A.67) 
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Indeed, assuming that Sj > ej for all j > jo, independent of n (recall Lemma [A.4|) . we can use the bound 



V- \Sj- (y- < c y- 
H SAv-V)i ~ H 



3=30 



3=30 



\Si-(ym < c ^ + 0r{w _ Vn]lof 

3=30 



(A.68) 



where Sj = Sj — {y n — is for fixed n the sum of i.i.d. random variables with mean 0. Combining (1A.60|) 
and (|A.62p . we obtain that O r (\u — v n \ logm n ) = o P (l), so we are left to bound the first contribution in 

According to the Marcinkiewicz-Zygmund inequality (recall (|5.19p ). for a £ (1,2), 



E, 



S*\ a ] < B^^Xk - (V n - l)) 2 ]^ < B*J2E n [\X k - (u n - 1)]|°] = jB* a E n [\X 1 - {V n - l)\ a 



k=l 



k=l 



where the second inequality follows from the fact that, for < r = a/2 < 1, 

(x + y) r < (\x\ + \y\) r < \x\ r + \y\ r . 

When we take 1 < a < r — 2, where r — 2 > 1, then uniformly in n, we have that E n [|Xi — I 7 ™]] ] < c a 
because 

00 i n i n H7fn a + 1 l 

Enti^r] = y: = f - E - 1) < 7- E D ? +1 ^ 



s=l 



i=l 



i=l 



1 1 



< OO, 



since a < r — 2, so that 



r ^ |5*h ^ E^g|°]V° _ ^ {cgBiyl^U, - K ~ l)H 1/a „ 

7 "2 J — ? 2 7 -2-l/a < 00 ' 

j=l J j=l J j=l J 



(A.69) 



since a > 1, and the last bound being true a.s. and uniform in n. The proof for 5 - is identical, where 
now, instead of (LA64|) . we use that there exists a > such that, whp, 



n 

f £ c «> 



o P n 



(A.70) 



j=n— mn+l 



using the argument in (|A.23p - (|A.24p . 



B Appendix: on the deviation from a tree 

In this section, we do the necessary preliminaries needed for the proof of Proposition 14.91 in Section [7J 
One of the ingredients is writing P(C n > m) as the expectation of the product of conditional probabilities 
(see (|7,9p and Lemma IB. lj) . A second issue of Section [7J is to estimate the two error terms in (I7.15|) . We 
will deal with these two error terms in Lemma [B. 31 Lemma [B. 21 is a preparation for Lemma lB.31 and gives 
an upper bound for the expected number of artificial stubs, which in turn is bounded by the expected 
number of closed cycles. 

In the statement of the following lemma, we recall that denotes the conditional distribution given 
SWGf 25 and {A}"=i- 

Lemma B.l (Conditional product form tail probabilities C n ) 

m 

P(C n > m) = E [ 11 QW(C n > j\C n > j - 1)] . (B.l) 
3=1 
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Proof. By the tower property of conditional expectations, we can write 

P(C n > m) = E[Q«(C n > m)] = E[Q«(C n > 1)Q«(C„ > m\C n > 1)] . (B.2) 

Continuing this further, for all 1 < k < in, 

Q«(C n > m\C n >k) = E^[Q^(C n > m\C n > k)] 

= E«[Q^(C n > k + l|C n > k)Q^ +1 \C n > m\C n > k + 1)), (B.3) 

where En fc) denotes the expectation w.r.t. Qn . In particular, 

P(C„ >m) =E[Q«(C n >m)] 

= E[Q«(C n > l)E«[Q«(C n > 2|C n > l)Q[ t 2) (Cn > m\C n > 2)]] 

= E[Q«(C n > l)Qf (C n > 2|C n > l)Qf (C n > m|C n > 2)] , (B.4) 

where the last equality follows since Qn (C n > 1) is measurable w.r.t. Qn 2> and the tower property. 
Continuing this indefinitely, we arrive at (|B.1|) . ■ 

Lemma B.2 (The number of cycles closed) (a) Fix r £ (2,3). Then, whp, there exist m n with 
™nl a n — ¥ °° an d C > such that for all m < m n and all e n j. 0, 

E n [i?« -m] <e- l (— )\ i = 1,2. (B.5) 



(b) Fix t > 3. Then, there exist fn n with m n /a n — > oo and C > such that for all m < m n , 

E[i?« - m] < Cm 2 In, i = 1, 2. (B.6) 

Proof. Observe that 

m 

i?«-m<X>i' ( B - 7 ) 
i=i 

where J7j is the indicator that a cycle is closed at time j. Since closing a cycle means choosing an allowed 
stub, which occurs with conditional probability at most Sj_-^/{L n — 2j — 1), we find that 

E[Uj | Sfl v L n ] = Sf_J{L n - 2j - 1), (B.8) 

so that 

m m 

E n [i?« - m] < J2E n [Uj] = ]rE n [^V(L n " 2 J " !)]■ ( B -9) 

3=1 3=1 

When r > 3, and using that, since j < m n = o(n), we have L n — 2j — 1 > 2n — 2j — 1 > n a.s. we arrive 
at 

ELR« - m] < - V E[S<.*\] < - + - V C(j - 1) < Cm 2 /n. (B.10) 
3=1 i=2 

When r G (2, 3), we have to be a bit more careful. In this case, we apply (jA.40|) to the r.h.s. of (|B.9j) . so 
that, whp, and uniformly in m, 



E« ~m}< —e-^- T)l{T - l) = e-H-) ■ (B.ll) 



This proves (|B.5|) . 
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Lemma B.3 (Treatment of error terms) As n — ► oo, there exists m n with m n /a n — > oo, such that 



■/?? 



n 



m=l 



Proof. We start with the first term. By Lemma lB.21 for r > 3, 

E[|Art$|] < E[R® - m] < Cm 2 /n, m < m n . 

As a result, we have that 

^E[|A<|]<Cm3/n 2 = o(l). 



(B.12) 



(B.13) 



(B.14) 



Again by Lemma lB.2\ but now for r G (2, 3), whp and uniformly in m < m n , where m n is determined in 
Lemma IB.2( 

^E n [\Art^\]<^e-'=o(l), (B.15) 



n 



whenever e" 1 — ► oo sufficiently slowly. 

Using (|B.7p and |Art m | < i? m — m, and using also that, whp and for all j < m, Sm > Sj_ v we 
obtain that 



r| Art^| 

Q(2) 



Er=i ^ 



Q(2) 
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< ^E n [-§-] < ^E„[l/(L n - 2j - 1)] < m/n, 



3=1 
in 



i=l ^-1 j=l 

where we used (|B~8l) in the one-but-last inequality. 
When r > 3, we thus further obtain 



(B.16) 



1 



E E 



m=l 



so that, also using the bound on Sdn that holds whp as proved in (|7.4p . 



r |Art (2) h 1 ^ 

[I^Emll < £ J- m/n = 0(m 2 Jn 2 ), 

m=l 



1 ™" CW| Ar+^l 
- E ^) = 

771=1 " L 



When r E (2,3), by (IBIBl) . 



(B.17) 



(B.18) 



»i=i 



, m " r | Art (2)| 



m=l 



so that, again using the bound on S a „ that holds whp as proved in (|7.4j) 



(B.19) 



E = Op(r/- 1 ^~ 2+(3 - r)/( ^ 1) «n^) = O^KK) V 1 ^-!)) = 0p (i), (B.20) 

m=l 



since a n = n^ T 2 ^( T ^ and whenever m n /a n ,rj n 1 — > oo sufficiently slowly such that n 1 (m„/o„) 2 

o(l). . 
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C Appendix: weak convergence of the weight for r > 3 

In this subsection we prove Proposition 14.3( b) and Proposition 14.6( b). for r > 3. Moreover, we show 
weak convergence of C n /a n and prove (|4.29|) for r > 3. We start with Proposition 14.3( b). 
For this, we rewrite T m (compare (|4.6p . with s, replaced by Si): 



T 



1 , Ei- 1 1 1 _ s 

log m = X; -5- +[L^-^T lo g m -J ( C - !) 



i=i i=i 



The second term on the r.h.s. of (|C.ip converges a.s. to some 1" by (|5.14[) . thus, it suffices to prove that 
X^LiC^i ~~ converges a.s. For this, we use that the second moment equals, due to the independence 
of {Ei}^ and {<Si}£i and the fact that E[E { ] = Vax(Ei) = 1, 



1 Et-M 



[(E^)] = -[E^]. 



i=l ~* "i=l 



which converges uniformly in m. This shows that 



T m ~ ^-logm ^ f;^i +y, (C.3) 

^ ~~ J- . , Oi 

i=i 

which completes the proof for T m for r > 3. 

We continue the proof of Proposition 14.91 by showing that, for r > 3, (|4.25p holds: 

Lemma C.l (Weak convergence of connection time) Fix t > 3, then, 

C n /a n -±> M, (C.4) 
where M has an exponential distribution with mean n/{y — 1), i-e., 

F(M > x) = exp{ - ^— ^xj. (C.5) 

H 

Proof. The proof is somewhat sketchy, we leave the details to the reader. We again make use of the 
product structure in Lemma lB . 1 1 (recall f|B .![) )■ and simplify (|7,12p . by taking complementary probabilities, 
to 

Q<T°(C n >m+ l|C n > m) « 1 - Sj»/L n . (C.6) 

For m < m n , error terms that are left out can easily be seen to be small by Lemma lB. 31 We next simplify 
by substitution of L n = fj,n, and using that e~ x ~ 1 — x, for x small, to obtain that 

Qfr>(C n >m + l\C n > m) « exp { - Sg/Qjm)}. (C.7) 

Substituting the above approximation into (jB.ip for m = a n x yields 

S m (y - 1) 

P(C n > a n x) Efexp { — a n x— — )1 = exp { a^x}, (C8) 

L k iin JJ L /in J 

where we approximate 5 m « {y — l)m. Since a n = y/ri, we arrive at (jC.4[> — tjC.5j> . ■ 
We now complete the proof of (|4.29p for r > 3. It is not hard to prove from ([(HI) that 

(T« - 7 log a n ,Tg _ 7 log C n ) (Xi,X 2 ), (C.9) 
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where (X\,X2) are two independent random variables with distribution given by 

m 

]T l/Sf^) -logm 



X\ — / V ^ + hm 

8=1 * 



E|^ + E(5w-^)+7«, (cio) 

i=l « t=l « v y 



i=l 

where 7 (o) is the Euler-Mascheroni constant. By Lemma IC. 11 

(T£> -7 log a n ,T£ - 7 log a n ) X 2 + 7 log M) , (C.ll) 

where M is the weak limit of C n /a n defined in (I4.25|) . We conclude that 

W" n -7logn^y = Xi +X 2 + 7logM. (C.12) 

Since {v—l)M/^, is an exponential variable with mean 1, A = log ((z^ — l)M//i) has a Gumbel distribution. 

Finally let us derive the distribution of X{ . The random variables Xi are related to a random variable 
W, which appears as a limit in a supercritical continuous-time branching process as described in Section 
14.11 Indeed, denoting by Z(t) the number of alive individuals in a continuous-time branching process 
where the root has degree D having distribution function F, while all other vertices in the tree have 
degree {-Bf nd) }°^ 2 > which are i.i.d. random variables with probability mass function g in (|2.3f) . Then, W 
arises as 

Z(t)e- ( ^- 1) * ^ W. (C.13) 

We note the following general results about the limiting distributional asymptotics of continuous-time 
branching processes. 

Proposition C.2 (The limiting random variables) (a) The limiting random variable W has the fol- 
lowing explicit construction: 

D 

W = Wje-t"- 1 *' . (C.14) 

Here D has distribution F, £j are i.i.d. exponential random variables with mean one independent ofWi, 
which are independent and identically distributed with Laplace transform (f>(t) = K(e~ tw ) given by the 
formula 

<t>- 1 (x) = (l-x)ex P y^ ( ^J 1 - + j^) ds }> 0<x<l, (C.15) 

and h(-) is the probability generating function of the size-biased probability mass function g (see A2.3\) ). 
(b) Let T m be the random variables defined as 



T m = Y,Ei/Sf nd \ (C.16) 



i=l 



where E{ are i.i.d. exponential random variables with mean one, and recall that is a random walk 

where the first step has distribution D where D ~ F and the remaining increments have distribution B — l 
where B has the size biased distribution. Then 

logm g.s. log (W/(v - 1)) 

J m ; ► ; , K^- 1 ' 

v — 1 v — 1 

where W is the martingale limit in ( fC. 13\) in part(a). 

(c) The random variables Xi, i = 1,2, are i.i.d. with Xi = g ^-^X — — ■ 
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Proof. These results follow from results about continuous-time branching processes (everything relevant 
to this result is taken from [2]). Part (b) is proved in [21 Theorem 2, p. 120]. To prove part (a) recall the 
continuous-time version of the construction described in Section 14.11 where we shall let D ~ F denote 
the number of offspring of the initial root and, for i > 2, Bi ~ g, the size-biased biased probability mass 
function (|2.3f) . Then note that for any t sufficiently large we can decompose Z(t), the number of alive 
nodes at time t as 

D 

Zit)^- 1 ^ 1 = Zi(t - ^e^- 1 ^. (C.18) 

i=l 

Here D, £j and the processes Zi(-) are all independent of each other, D ~ F denotes the number of 
offspring of the root, £j are lifetimes of these offspring and are distributed as i.i.d. exponential random 
variables with mean 1, and Zj(-), corresponding to the subtrees attached below offspring j of the root, are 
independent continuous-time branching processes where each individual lives for an exponential mean 1 
amount of time and then dies, giving birth to a random number of offspring where the number of offspring 
has distribution B ~ g as in f|2.3|) . 

Now known results (see [21 Theorem 1, p. Ill and Theorem 3, p. 116]) imply that 

^(t)e-^- 1 )' ^ W U 

where Wi have Laplace transform given by (|CH5l) . Part (a) now follows by comparing (|C.14p with f|C. 18|) . 
Part(c) follows from part(b) and observing that 

1 - E ._ 1 ™ i I 
T m — -. r log m = > 7— 7T- + > j. — 7 r log m, 

(u-l) ^ S > ^ S 5 (u-l) 

y 1 1=1 u l t=l u t y ' 

and a comparison with (1C.10I) . This completes the proof. ■ 

Thus, with A a Gumbel distribution, the explicit distribution of the re-centered minimal weight paths 
is given by 

log {W x j(u - 1)) log {W 2 /(u - 1)) 

^ = + 7A-7log(i/- 1)/m, (C.19) 

^ — 1 i/ — 1 

since log M = A — log((f — l)//z). Rearranging terms establishes the claims on the limit below Theorem 
13.11 and completes the proof of (|4.29[) in Proposition I4.9f b) for r > 3. ■ 
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